/example-gt-repo

This is a kind of mockup: Sample repository for ocr ground truth data.

Primary LanguageXSLTCreative Commons Zero v1.0 UniversalCC0-1.0

Example Ground Truth

Organization of folders and files

Collection represents the entire repository. Collection is a placeholder for the concrete name of a collection.

💡 For example, GermanBooks17thCentury.

The METADATA.yml and LICENSE.md files provide specific information/metadata about the repository.

Images: keep images in the same directory as the textual transcription (eg. GT-PAGE).

The structure of the repo is the following:

├── METADATA.yml
├── LICENSE.md
└── data
      └── document_title or identifer
          ├── GT-PAGE
          ├── GT-ALTO
          └── GT-LINE
            
           

METADATA File format

Description are coming

GitHub Action workflow

OCR-D-Bagits

Description are coming

Ground Truth Analysis

Description are coming