Skip to content

Releases: OCR-D/core

v2.59.0

27 Nov 16:22
@kba kba
Compare
Choose a tag to compare

Changed:

  • Change web API paths to avoid any potential URL segment clashes, #1136, OCR-D/spec#250
    • GET / -> GET /info
    • POST / -> POST /run
    • /{job-id} -> /job/{job-id}
    • /{job-id}/log -> /log/{job-id}

Fixed:

  • WorkspaceBagger: do not overwrite files in case of filename conflict, #1129, #1137
  • Update apidocs to include ocrd_network, #1131

Added:

  • ocrd workspace update-page to set attributes on the mets:div of a page, #1133, #1134
  • ocrd workspace list-page now has configurable output format and optional partitioning of the page list, #1140, #1141
  • ocrd zip bag, ocrd workspace merge, ocrd workspace clone now support whitelisting/blacklisting file groups, #356, #383, #506, #582, #1138, #1139
  • workflow endpoint supports storing and deduplicating workflows, #1143

Removed:

  • OcrdMets: remove Unused __exit__ method,.#1130 #1132

v2.58.1

20 Oct 11:04
@kba kba
Compare
Choose a tag to compare

Fixed:

  • bashlib: regression introduced in v2.58.0 breaking non-mets-server calls, #1128

v2.58.0

20 Oct 09:38
@kba kba
Compare
Choose a tag to compare

Fixed:

  • helpers.run_cli: Handle both int and str log levels, #1121
  • bashlib: typo ocrd_argv -> ocrd__argv, #1122, #1123
  • processing workers: pass log level as string and initLogging at the right time, Handle logging of bashlib workers separately, #1123 #1127
  • ocrd workspace bulk-add now supports -U/--mets-server-url, #1126
  • bashlib: Support -U as alias for --mets-server-url, #1126

Added:

  • METS server: POST /reload to reload METS from disk, #1123, #1124

v2.57.2

18 Oct 17:31
@kba kba
Compare
Choose a tag to compare

Fixed:

  • bashlib: remove vestigial --log-filename option from #1105, #1120

v2.57.0

18 Oct 17:31
@kba kba
Compare
Choose a tag to compare

Fixed:

  • running a processor as a worker no longer dumps ocrd-tool.json and messed up logging, #1116

Changed:

  • logging: With ocrd_logging.conf (e.g. in Docker), log all messages DEBUG and up and log to ocrd.log, #1117

v2.56.0

13 Oct 08:33
@kba kba
Compare
Choose a tag to compare

Changed:

  • A separate logging dir tree structure for the modules (processing servers, processing workers, processor servers, mets servers, processing jobs). Configurable with env v
    • Processing job-level logging - each job is logged into a separate file with format {job_id}.log
    • Processing job-level logging file paths are added to the Job models and preserved in the database.
    • The ocrd_network logging is based on the format provided in ocrd_utils
  • Support env variable OCRD_NETWORK_SOCKETS_ROOT_DIR for setting the root directory for METS server sockets, #1111
  • An endpoint /job/{id}/log for getting the log file of a processing job of a processor, #1111

v2.55.2

13 Oct 08:33
@kba kba
Compare
Choose a tag to compare

Fixed:

v2.55.1

12 Oct 10:14
@kba kba
Compare
Choose a tag to compare

Changed:

  • ocrd workspace bulk-add distinguishes between url and local_filename, supporting both, #1086, #1079, #1113

v2.55.0

11 Oct 11:55
@kba kba
Compare
Choose a tag to compare

Added:

  • /workflow endpoint that can handle ocrd process workflows and distribute jobs page-wise across workers, #1083, #1105, #1108, #1109

Changed:

  • METS Server: Make sockets world-readable and -writable, #1098, #1099
  • METS Server: Implement find_files support for local_filename and url, #1100
  • Logging: consistent logger names derived from ocrd., #1101
  • Logging: consistent logging across the packages, including ocrd_network, #1101
  • .. page range operator: allow single-page ranges, #1106, #1107

v2.54.0

12 Sep 15:30
@kba kba
Compare
Choose a tag to compare

Added:

  • METS Server: providing concurrent additional access to the METS file for parallel processing, #966
  • Web API: Cache jobs in the processing server with an optional callback once processed, #1069
  • Web API: Lock pages output file groups of a workspace to prevent simultaneous non-additive access to workspaces, #1069
  • Web API: Support job dependency for caching complete fully-deterministic workflows, #1069
  • Web API: Processing server will start all ready requests, not just the first one, #1069
  • Web API: Workers will create on demand, retry attempts configurable via OCRD_NETWORK_WORKER_QUEUE_CONNECT_ATTEMPTS, #1093
  • ocrd_utils.config to collect all configuration based on environment variables in one place, #1081
  • Processor CLI: Warn if a given page ID cannot be found in METS, #1088, #1089

Changed:

  • Processors now have worker and server subcommands, with separate --help, for starting processing worker/processor server, #1087
  • Move tf_disable_interactive_logs (to silence keras/tensorflow print statements) to ocrd_utils.logging and do not call on module-level, #1090, #1091
  • 🔥 We do now properly distinguish between original and local-file FLocat, original URL will not be changed for downloads anymore, #323, #1079
  • 🔥 logging has been streamlined to be better usable as a library, #1080