Tobler-Lommatzsch: Altfranzösisches Wörterbuch : Ground truth and Models for OCR

This repository contains:

  • training and evaluation data from transcription of Tobler-Lommatzsch: Altfranzösisches Wörterbuch
  • scripts to generate double columns transcription html
  • OCR models trained and tested using this data for Kraken.

License

Licence Creative Commons
Cette œuvre est mise à disposition selon les termes de la Licence Creative Commons Attribution 4.0 International.

Contribute

If you want to contribute training data or models, you can do so by cloning the repository and sending us a pull request, or by sending an email at thibault.clerice at chartes.psl.eu .

Cite this repository

Thibault Clérice (éd.), Tobler-Lommatzsch: Altfranzösisches Wörterbuch : Ground truth and Models for OCR, Paris: École nationale des chartes (PSL), 2018, https://github.com/PonteIneptique/toebler-ocr.