ocr-d
There are 97 repositories under ocr-d topic.
UB-Mannheim/tesseract
Tesseract Open Source OCR Engine (main repository)
UB-Mannheim/ocr-fileformat
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
OCR-D/core
Collection of OCR-related python tools and wrappers from @OCR-D
OCR-D/ocrd_all
Master repository which includes most other OCR-D repositories as submodules
OCR-D/ocrd_segment
OCR-D-compliant page segmentation
qurator-spk/dinglehopper
An OCR evaluation tool
OCR-D/ocrd_anybaseocr
DFKI Layout Detection for OCR-D
OCR-D/ocrd_tesserocr
Run tesseract with the tesserocr bindings with @OCR-D's interfaces
cisocrgroup/ocrd_cis
OCR-D python tools
hnesk/browse-ocrd
An extensible viewer for OCR-D mets.xml files
OCR-D/spec
Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)
bertsky/ocrd_detectron2
OCR-D wrapper for detectron2 based segmentation models
ASVLeipzig/cor-asv-ann
OCR-D post-correction with encoder-attention-decoder LSTMs
OCR-D/ocrd_calamari
Recognize text using Calamari OCR and the OCR-D framework
OCR-D/page-to-alto
Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)
OCR-D/format-converters
Converters for various file formats used for representing OCR
ASVLeipzig/cor-asv-fst
OCR-D post-correction module based on weighted finite-state transducers
OCR-D/ocrd_kraken
Wrapper for the kraken OCR engine
bertsky/ocrd_publaynet
convert PubLayNet data into METS/PAGE-XML
bertsky/workflow-configuration
a makefilization for OCR-D workflows, with configuration examples
slub/ocrd_manager
frontend for ocrd_controller and adapter towards ocrd_kitodo
VRI-UFPR/ocrd-gbn
OCR-D compliant toolset for optical layout recognition on historical german-language documents published in Brazil
ocr-d-modul-2-segmentierung/ocrd-pixelclassifier-segmentation
Wrapper around pixel classifier
slub/ocrd_kitodo
Docker integration of Kitodo.Production and OCR-D
bertsky/nmalign
forced alignment of lists of string by fuzzy string matching
OCR-D/gt-repo-template
A template for creating a ground truth repo with the various functions and features: such as metadata creation, data analysis and presentation.
UB-Mannheim/ocrd_pagetopdf
OCR-D wrapper for prima-pagetopdf
OCR-D/ocrd_keraslm
Simple character-based language model using keras
qurator-spk/ocrd-galley
A Dockerized test environment for OCR-D processors 🚢
StabiBerlin/ocrd_butler
A butler is a domestic worker in a large household. The butler, as the senior servant, has the highest servant status. He can also sometimes function as a chauffeur.
bertsky/docstruct
Document structure detection from PAGE-XML to METS-XML
OCR-D/gt-guidelines
OCR-D guidelines for Ground Truth production
OCR-D/ocrd_olena
Binarize with Olena/scribo
OCR-D/ocrd_typegroups_classifier
Font family detection in historical documents.
qurator-spk/train-calamari-gt4histocr
Train a GT4HistOCR Calamari model
slub/ocrd_controller
Path to network implementation of OCR-D