Pinned Repositories
gt-repo-template
A template for creating a ground truth repo with the various functions and features: such as metadata creation, data analysis and presentation.
keyboardGT
Offer of different keyboards for transcription software (Aletheia, Transkribus, LAREX, QURATOR-neat, eScriptorium)
AletheiaTools
AletheiaTools is a collection of tools for transforming file formats (PAGE XML) and metadata formats (METS). It is a kind of Ground Truth Swiss Knife ;-)
choco-mufin
Tools for normalizing the use of some characters and checking file consistencies
digi-gt
Ground truth for the digitized historic collections of UB Mannheim
gt-fraktur
gt-guidelines
OCR-D guidelines for Ground Truth production
gt_corpus_benchmark
This repo provides a collection of ground truth data. The collection was compiled under different aspects (complexity of the layouts and use of the fonts). The individual data are also characterized by metadata. The metadata is based on the labeling scheme of OCR-D/PrimaLab.
page2page
This repository save the stylesheet and workaround for transforming the properitary PAGE XML file from Transkribus (https://transkribus.eu/Transkribus) into a PAGE XML valid format (https://www.primaresearch.org/schema/PAGE/gts/pagecontent/ newest version from 2019-07-16
page2tei
tboenig's Repositories
tboenig/choco-mufin
Tools for normalizing the use of some characters and checking file consistencies
tboenig/gt-fraktur
tboenig/gt-guidelines
OCR-D guidelines for Ground Truth production
tboenig/reichsanzeiger-gt
Ground truth for German newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–1945)
tboenig/17_fontmix_complex
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
tboenig/18_frak_complex
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
tboenig/19_frak_simple
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
tboenig/CITATIONupdate
tboenig/dtabf_new
DTA Base Format (DTABf)
tboenig/evt-viewer-angular
Edition Visualization Technology version 3
tboenig/German-Brazilian-Newspapers-Dataset_1
The GBN Dataset consists German-Brazilian historical newspapers, along with their digital and binarized images and ground truth files.
tboenig/German-Brazilian-Newspapers-Dataset_2
The GBN Dataset consists German-Brazilian historical newspapers, along with their digital and binarized images and ground truth files.
tboenig/gt-test
TEST GT
tboenig/gt_structure_1_1_AG_OCR_workshop
tboenig/gt_structure_3_1_debug
The repo gt_structure_3_1 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.
tboenig/gt_structure_text_debug
The OCR-D Ground Truth text and structure corpus was created between 2015 -2017. In the years since 2017, this corpus has been further curated and supplemented with metadata where appropriate. The corpus includes page XML files within annotations of the text and structure include.
tboenig/gt_structure_text_test
tboenig/gtesty
tboenig/htr-united
Ground Truth Resources for the HTR of patrimonial documents
tboenig/mkn-kurrent-gt
Kurrent GT from the Moravian Knowledge Network handwritten periodicals
tboenig/ocr_trainingsdaten
tboenig/schema
Repository for schema related business
tboenig/sub_modul_test
tboenig/t_guidelines
tboenig/uEdition
A micro framework for building digital editions
tboenig/uEditor
A web-based editor for μEditions
tboenig/ulb-groundtruth-eval-odem-ger
OCR Grountruth ULB VD18 German Fraktur - OCR-D Phase III
tboenig/ulb-groundtruth-eval-odem-lat
OCR Groundtruth ULB VD18 Latin - OCR-D Phase III
tboenig/ulb-groundtruth-eval-odem-other
tboenig/ulb-groundtruth-eval-odem-other_test
OCR Groundtruth ULB VD18 - OCR-D Phase III