Pinned Repositories
cor-asv-ann
OCR-D post-correction with encoder-attention-decoder LSTMs
nmalign
forced alignment of lists of string by fuzzy string matching
ocrd_detectron2
OCR-D wrapper for detectron2 based segmentation models
ocrd_publaynet
convert PubLayNet data into METS/PAGE-XML
page_dewarp
Text page dewarping using a "cubic sheet" model
workflow-configuration
a makefilization for OCR-D workflows, with configuration examples
ocrd_cis
OCR-D python tools
ocrd_all
Master repository which includes most other OCR-D repositories as submodules
ocrd_keraslm
Simple character-based language model using keras
ocrd_tesserocr
Run tesseract with the tesserocr bindings with @OCR-D's interfaces
bertsky's Repositories
bertsky/ocrd_publaynet
convert PubLayNet data into METS/PAGE-XML
bertsky/page_dewarp
Text page dewarping using a "cubic sheet" model
bertsky/board
ALTO board meeting minutes, agendas, and miscellaneous business
bertsky/docs
OCR-D Documentation
bertsky/hfst
Helsinki Finite-State Technology (library and application suite)
bertsky/keras
Deep Learning for humans
bertsky/LAREX
A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.
bertsky/networkx
Official NetworkX source code repository.
bertsky/ocr-d.github.io
Website for OCR-D specs, formats, requirements
bertsky/ocrd_im6convert
bertsky/ocrd_repair_inconsistencies
Automatically fix PAGE-XML order inconsistencies in regions, lines and words
bertsky/olena
Fork of the project with patches by @bertsky
bertsky/Omniscribe
bertsky/PAGE-XML
PAGE XML format collection for document image page content and more
bertsky/pyleptonica
Automatically exported from code.google.com/p/pylepthonica
bertsky/seq2seq
Sequence to Sequence Learning with Keras
bertsky/tensorflow
Computation using data flow graphs for scalable machine learning
bertsky/transkribus-page2page
This repository save the stylesheet and workaround for transforming the properitary PAGE XML file from Transkribus (https://transkribus.eu/Transkribus) into a PAGE XML valid format (https://www.primaresearch.org/schema/PAGE/gts/pagecontent/ newest version from 2019-07-16