/transkribus-extract-textual-tags

Extract textual tags from Transkribus PAGE XML to CSV

Primary LanguagePythonCreative Commons Attribution 4.0 InternationalCC-BY-4.0

transkribus-export-textual-tags

Python module to extract textual tags from Transkribus PAGE XML to CSV.

Creator

This software and sample dataset were created by the University of Basel's Research and Infrastructure Support RISE (rise@unibas.ch) in November 2022.

Quickstart

Export Transkribus PAGE XML documents to a folder and extract the textual tags by running

from transkribus_extract_textual_tags.client import Client

Client.extract_from_dir(dir_path="my/transkribus/document/folder",
                        save_file_path="my/output/file.csv"")

License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.