OCR-D
DFG-Koordinierungsprojekt zur Weiterentwicklung von Verfahren der Optical Character Recognition
Pinned Repositories
core
Collection of OCR-related python tools and wrappers from @OCR-D
gt-guidelines
OCR-D guidelines for Ground Truth production
ocrd-webapi-implementation
ocrd-website
ocrd_all
Master repository which includes most other OCR-D repositories as submodules
ocrd_anybaseocr
DFKI Layout Detection for OCR-D
ocrd_calamari
Recognize text using Calamari OCR and the OCR-D framework
ocrd_segment
OCR-D-compliant page segmentation
ocrd_tesserocr
Run tesseract with the tesserocr bindings with @OCR-D's interfaces
spec
Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)
OCR-D's Repositories
OCR-D/core
Collection of OCR-related python tools and wrappers from @OCR-D
OCR-D/ocrd_all
Master repository which includes most other OCR-D repositories as submodules
OCR-D/ocrd_segment
OCR-D-compliant page segmentation
OCR-D/ocrd_anybaseocr
DFKI Layout Detection for OCR-D
OCR-D/ocrd_tesserocr
Run tesseract with the tesserocr bindings with @OCR-D's interfaces
OCR-D/ocrd-website
OCR-D/spec
Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)
OCR-D/ocrd_calamari
Recognize text using Calamari OCR and the OCR-D framework
OCR-D/page-to-alto
Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)
OCR-D/ocrd_kraken
Wrapper for the kraken OCR engine
OCR-D/ocrd_froc
OCR-D/ocrd_keraslm
Simple character-based language model using keras
OCR-D/ocrd_olena
Binarize with Olena/scribo
OCR-D/assets
Test data for testing specs and software in @OCR-D
OCR-D/gt_structure_text
The OCR-D Ground Truth text and structure corpus was created between 2015 -2017. In the years since 2017, this corpus has been further curated and supplemented with metadata where appropriate. The corpus includes page XML files within annotations of the text and structure include.
OCR-D/ocr-d.github.io
Website for OCR-D specs, formats, requirements
OCR-D/ocrd_im6convert
Run ImageMagick with an OCR-D CLI
OCR-D/ocrd_fileformat
OCR-D wrapper for ocr-fileformat
OCR-D/gt_structure_all
OCR-D/quiver-benchmarks
Benchmarking OCR-D workflows in Docker
OCR-D/ocr-fileformat
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
OCR-D/ocrd_olahd_client
OCR-D/quiver-frontend
OCR-D/gt_structure_4_1
The repo gt_structure_4_1 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.
OCR-D/gt_structure_4_2
The repo gt_structure_4_2 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.
OCR-D/gt_structure_4_3
The repo gt_structure_4_3 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.
OCR-D/gt_structure_5_1
The repo gt_structure_5_1 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.
OCR-D/gt_structure_5_2
The repo gt_structure_5_2 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.
OCR-D/gt_structure_5_3
The repo gt_structure_5_3 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.
OCR-D/quiver-mongoapi-local
Middleware for running Quiver locally