M3ssman's Stars
rust-lang/rustlings
:crab: Small exercises to get you used to reading and writing Rust code!
robotframework/robotframework
Generic automation framework for acceptance testing and RPA
urllib3/urllib3
urllib3 is a user-friendly HTTP client library for Python
LibrePDF/OpenPDF
OpenPDF is a free Java library for creating and editing PDF files, with a LGPL and MPL open source license. OpenPDF is based on a fork of iText. We welcome contributions from other developers. Please feel free to submit pull-requests and bugreports to this GitHub repository.
apache/pdfbox
Mirror of Apache PDFBox
StractOrg/stract
web search done right
UB-Mannheim/ocr-fileformat
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
cneud/ocr-gt
OCR & Ground Truth Resources
LanguageMachines/PICCL
A set of workflows for corpus building through OCR, post-correction and normalisation
cisocrgroup/PoCoTo
The CIS OCR PostCorrectionTool
artefactual-labs/mets-reader-writer
Library to parse and create METS files, especially for Archivematica.
cgohlke/psdtags
Read and write layered TIFF ImageSourceData and ImageResources tags
OCR-D/format-converters
Converters for various file formats used for representing OCR
ulb-sachsen-anhalt/ulb-zeitungsprojekt-hp1
Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"
OCR-D/ocrd_kraken
Wrapper for the kraken OCR engine
rescribe/tesseract-models
OCR models that can be used with the Tesseract OCR software
da-nrw/DNSCore
The Core System of the DA-NRW Software Suite.
ulb-sachsen-anhalt/digital-eval
Evaluate data from mass digitalization workflows
OCR-D/ocrd-demo-mets-server
slub/kitodo-production-docker
Kitodo.Production Docker
ulb-sachsen-anhalt/ulb-groundtruth-eval-odem-ger
OCR Grountruth ULB VD18 German Fraktur - OCR-D Phase III
ulb-sachsen-anhalt/ulb-groundtruth-eval-odem-lat
OCR Groundtruth ULB VD18 Latin - OCR-D Phase III
ulb-sachsen-anhalt/ulb-groundtruth-eval-odem-other
OCR Groundtruth ULB VD18 - OCR-D Phase III
unt-libraries/pymets
Python module for writing and reading METS records
Deutsche-Digitale-Bibliothek/ddblabs-datapreparationtool
flyingcircusio/gocept.testdb
Create and drop temporary databases for testing purposes.
ulb-sachsen-anhalt/ocrd-odem
OCR Workflows based on OCR-D
JKamlah/PagePlus
This script processes PAGE XML files, a format widely used in document layout analysis, to perform various operations like validating, repairing, extending, and modifying text regions and lines.
Digital-Preservation-Finland/dpres-siptools-ng
Library for creating Submission Information Packages (SIP) that comply to the specifications of national digital preservation services of Finland.
ulb-sachsen-anhalt/digital-flow
Little Digitization Workflow Helpers