/docstruct

Document structure detection from PAGE-XML to METS-XML

Primary LanguagePythonApache License 2.0Apache-2.0

docstruct

Document structure detection from PAGE to METS

Provides an OCR-D processor which will parse the input page-level structure (as detected by some OCR-D workflow including preprocessing, layout analysis and OCR) of a document annotated via PAGE-XML and METS-XML, further analyse it (...) and wrap it into a document-level structure in the METS using logical mets:structMap and either …

… for representation.