/prima-page-metadata-scanner

PAGE Metadata Scanner is a command line tool that scans a single PAGE XML file (document layout and text content) and outputs its properties in CSV format.

Primary LanguageHTMLApache License 2.0Apache-2.0

Stargazers