Pinned Repositories
alto-tools
Python script for performing various operations on ALTO XML files
Apache
Apache License
AST-PST-Tablerecognizer
Recognize and extract AST-PST-Tables
AustrianNewspapers
NewsEye / READ OCR training dataset from Austrian Newspapers
BackgroundSubtractor4OCR
BackgroundSubtractor4OCR
german-newspapers-ocr-model
This repository contains models for historical newspapers.
GTMake
Creating gitrepobased GT-Linepairs with ease
PagePlus
This script processes PAGE XML files, a format widely used in document layout analysis, to perform various operations like validating, repairing, extending, and modifying text regions and lines.
scrape-editorial-board
Scraping editorial board of journals
tesseractXplore
tesseractXplore a tesseract ease of use gui with full control
JKamlah's Repositories
JKamlah/AST-PST-Tablerecognizer
Recognize and extract AST-PST-Tables
JKamlah/SlothGen
JKamlah/GTMake
Creating gitrepobased GT-Linepairs with ease
JKamlah/tessdata_fast_ubma
Fast integer versions of trained LSTM models
JKamlah/tessdata_best_ubma
Best (most accurate) trained LSTM models.
JKamlah/page2tsv
PAGE-XML to TSV
JKamlah/BackgroundSubtractor4OCR
BackgroundSubtractor4OCR
JKamlah/ratocer
RATOCER- Reichsanzeiger table of content extraction and recognition
JKamlah/RaiseWikibase
RaiseWikibase: Fast inserts into the BERD instance
JKamlah/tessdata_best
Best (most accurate) trained LSTM models.
JKamlah/KivyMD
KivyMD is a collection of Material Design compliant widgets for use with Kivy, a framework for cross-platform, touch-enabled graphical applications. https://youtube.com/c/KivyMD https://twitter.com/KivyMD https://habr.com/ru/users/kivymd https://stackoverflow.com/tags/kivymd
JKamlah/GTCheck
Check your modified Ground Truth files with visual support!
JKamlah/ocrd_contrib_ubma
Helper scripts for OCR-D
JKamlah/symspellpy
Python port of SymSpell
JKamlah/dinglehopper
An OCR evaluation tool
JKamlah/okralact
A repository for online OCRD training infrastructure.
JKamlah/OCR_web
An OCR (tesseract) web interface to upload images. The idea of this project is to study technologies like Python, Django, Continuous Integration, Celery, etc...
JKamlah/the-book-of-secret-knowledge
A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.
JKamlah/python-cheatsheet
Comprehensive Python Cheatsheet
JKamlah/nashi
Some bits of javascript to transcribe scanned pages using PageXML
JKamlah/newspapers_regions_and_reading_order
JKamlah/AustrianNewspapers
NewsEye / READ OCR training dataset from Austrian Newspapers
JKamlah/scrape-editorial-board
Scraping editorial board of journals
JKamlah/Wordhunt
Fuzzy Wordhunt
JKamlah/tesstrain
Train Tesseract LSTM with make
JKamlah/spleeter
Deezer source separation library including pretrained models.
JKamlah/hocr-tools
Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.
JKamlah/loc-db
This is the central component of the LOC-DB project.
JKamlah/qt-box-editor
QT4 editor of tesseract-ocr box files
JKamlah/microservices-using-rabbitmq
Python & Go microservices on Docker, using RabbitMQ for asynchronous IPC