alto-ocr-text Extracts the text from an ALTO file and writes it to stdout. Use like: python alto_ocr_text.py <altofile>