Pinned Repositories
kitodo-presentation
Kitodo.Presentation is a feature-rich framework for building a METS- or IIIF-based digital library. It is part of the Kitodo Digital Library Suite.
kitodo-production
Kitodo.Production is a workflow management tool for mass digitization and is part of the Kitodo Digital Library Suite.
BibsOnGitHub
Bibliothekarische Organisationen und Personen auf GitHub
cyg2deb
Convert Cygwin packages to Debian packages
OSXvnc
VNC Server for macOS
raspberrypi-documentation
Official documentation for the Raspberry Pi
tesseract
Tesseract Open Source OCR Engine (personal development fork)
ocr-fileformat
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
ocr-gt-tools
Ergonomic line-by-line transcription of scanned text.
PalMA
PalMA Team Monitor
stweil's Repositories
stweil/bigbluebutton
Complete open source web conferencing system.
stweil/langdata_lstm
stweil/openjpeg
Official repository of the OpenJPEG project
stweil/tesserocr
A Python wrapper for the tesseract-ocr API
stweil/17_fontmix_complex
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
stweil/19_frak_simple
This repository provides the Ground Truth data for the OCR-D Quiver back end. This data serves as a basis for benchmarking the performance and accuracy of different OCR-D workflows for different types of input data.
stweil/assets
Test data for testing specs and software in @OCR-D
stweil/dita-ot-docs
DITA Open Toolkit documentation
stweil/escriptorium
Personal fork of https://gitlab.com/scripta/escriptorium
stweil/format-converters
Converters for various file formats used for representing OCR
stweil/gt_structure_1_1
The repo gt_structure_1_1 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.
stweil/gt_structure_4_2
The repo gt_structure_4_2 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.
stweil/kraken
OCR engine for all the languages
stweil/langdata
Source training data for Tesseract for lots of languages
stweil/leptonica
Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The official GitHub repository for Leptonica is https://github.com/DanBloomberg/leptonica. See the official website for more documentation and recent releases:
stweil/ocrd_anybaseocr
DFKI Layout Detection for OCR-D
stweil/ocrd_core
Collection of OCR-related python tools and wrappers from the OCR-D team
stweil/ocrd_froc
stweil/ocropus4inf
stweil/peerjs
Simple peer-to-peer with WebRTC.
stweil/peerjs-server
Server for PeerJS
stweil/Speedometer
An open source repository for the Speedometer benchmark
stweil/tessdata
stweil/tessdata_best
Best (most accurate) trained LSTM models.
stweil/tessdata_fast
Fast integer versions of trained models
stweil/tesstrain
Train tesseract 4 with make
stweil/ulb-groundtruth-eval-odem-ger
OCR Grountruth ULB VD18 German Fraktur - OCR-D Phase III
stweil/ulb-groundtruth-eval-odem-other
OCR Groundtruth ULB VD18 - OCR-D Phase III
stweil/zotero
Zotero is a free, easy-to-use tool to help you collect, organize, annotate, cite, and share your research sources.
stweil/zotero-ocr
Zotero Plugin for OCR