/alto-ocr-text

extract text from ALTO file

Primary LanguagePython

This is no longer supported, please use https://github.com/cneud/alto-tools.

alto-ocr-text

Extracts the text from an ALTO file and writes it to stdout.

Use like:

python alto_ocr_text.py <altofile>