Pinned Repositories
bbw
Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup
malibu
Mannheim library utilities
ocr-fileformat
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
ocr-gt-tools
Ergonomic line-by-line transcription of scanned text.
PalMA
PalMA Team Monitor
RaiseWikibase
Knowledge graph construction: Fast inserts into a Wikibase instance
spacyopentapioca
A spaCy wrapper of OpenTapioca for named entity linking on Wikidata
tesseract
Tesseract Open Source OCR Engine (main repository)
zotero-ocr
Zotero Plugin for OCR
zotkat
Erweiterung von Zotero für die Katalogisierung
Universitätsbibliothek Mannheim's Repositories
UB-Mannheim/zotkat
Erweiterung von Zotero für die Katalogisierung
UB-Mannheim/AustrianNewspapers
NewsEye / READ OCR training dataset from Austrian Newspapers (1864–1911)
UB-Mannheim/Fibeln
Transkriptionen von Fibeln (19. Jahrhundert)
UB-Mannheim/GTCheck
Check your modified Ground Truth files with visual support!
UB-Mannheim/PagePlus
This script processes PAGE XML files, a format widely used in document layout analysis, to perform various operations like validating, repairing, extending, and modifying text regions and lines.
UB-Mannheim/Weisthuemer
Ground truth for Jakob Grimm / Weisthümer
UB-Mannheim/stabi-berlin-gt
Ground truth for digitized publications of Staatsbibliothek zu Berlin
UB-Mannheim/charlottenburger-amtsschrifttum
Werkspezifisches Training Charlottenburger Amtsschrifttum (1879–1919)
UB-Mannheim/guacamole-docker
docker-compose configuration and tools for running Apache Guacamole using Docker
UB-Mannheim/NZZ-black-letter-ground-truth
Ground truth for swiss newspaper "Neue Zürcher Zeitung" (1780–1947)
UB-Mannheim/ocr-model-repo-template
A template for creating an OCR model repository with various functions and features, such as metadata creation and presentation.
UB-Mannheim/Projects
Projects of Mannheim University Library
UB-Mannheim/ocr-model-metadata
Metadata tool for ocr models
UB-Mannheim/Aktienfuehrer-KG
Feedback gathering for the Aktienführer Knowledge Graph
UB-Mannheim/cas2iob
A converter of UIMA CAS XMI files exported from INCEpTION into IOB TSV files with nested NER/NEL tags and components
UB-Mannheim/madata
A tool for syncing the dataset-metadata between MADATA and Wikidata
UB-Mannheim/tesstrain
Train Tesseract LSTM with make
UB-Mannheim/ansible
Ansible is a radically simple IT automation platform that makes your applications and systems easier to deploy and maintain. Automate everything from code deployment to network configuration to cloud management, in a language that approaches plain English, using SSH, with no agents to install on remote systems. https://docs.ansible.com.
UB-Mannheim/ansights
UB-Mannheim/dc-update
Update docker-compose managed services with one command
UB-Mannheim/easydb-documentation
easydb documentation
UB-Mannheim/feed2js
Feed to Javascript
UB-Mannheim/gt-fraktur
UB-Mannheim/masterfiles
Policy masterfiles that are shipped with CFEngine packages
UB-Mannheim/Omeka-plugin-BulkImportFiles
Plugin for Omeka to import files in bulk with their internal metadata (exif, iptc and xmp for images, audio and video, pdf, etc.).
UB-Mannheim/open-web-calendar
Embed a highly customizable web calendar into your website using ICal source links
UB-Mannheim/prima-core-libs
Core libraries by the PRImA Research Lab
UB-Mannheim/quiver-benchmarks
Benchmarking OCR-D workflows in Docker
UB-Mannheim/theme-madataplan
RDMO theme for https://fdz.bib.uni-mannheim.de/madataplan
UB-Mannheim/zotpress
Testing ground for in-progress versions of Zotpress (Zotero + WordPress plugin)