Pinned Repositories
dinglehopper
An OCR evaluation tool
eynollah
Document Layout Analysis
mods4pandas
Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis
neat
Named entity annotation tool
sbb_binarization
Document Image Binarization
sbb_images
Image Annotation Tool and Image Search
sbb_ned
Named Entity Disambiguation and Linking
sbb_ner
Named Entity Recognition
sbb_ocr_postcorrection
Two-Step Approach to OCR Post-Correction
sbb_textline_detection
Detect textlines in document images
IDM 4 Projects's Repositories
qurator-spk/train-calamari-gt4histocr
Train a GT4HistOCR Calamari model
qurator-spk/sbb_page_extractor
qurator-spk/ocrodeg
document image degradation
qurator-spk/page2img
qurator-spk/2021-04-match-ocr-text-vs-gt-text
Match OCR page text to GT page text
qurator-spk/ocr-fileformat
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
qurator-spk/ocrd_butler
A butler is a domestic worker in a large household. The butler, as the senior servant, has the highest servant status. He can also sometimes function as a chauffeur.