UB-Mannheim/ocr-fileformat

Support validation

Closed this issue · 0 comments

kba commented

ALTO is defined in XML schema, we could use those to add a ocr-validate command.

Does a XML Schema for HOCR make sense? Structurally it's certainly not hard but namespaces, doctypes and so on could be trouble.