JKamlah

Pinned Repositories

alto-tools
Python script for performing various operations on ALTO XML files
Language:Python0 3 00
Apache
Apache License
0 2 01
AST-PST-Tablerecognizer
Recognize and extract AST-PST-Tables
Language:Python2 3 01
AustrianNewspapers
NewsEye / READ OCR training dataset from Austrian Newspapers
0 2 00
BackgroundSubtractor4OCR
BackgroundSubtractor4OCR
Language:Python6 3 02
german-newspapers-ocr-model
This repository contains models for historical newspapers.
2 2 00
GTMake
Creating gitrepobased GT-Linepairs with ease
Language:Python1 3 01
PagePlus
This script processes PAGE XML files, a format widely used in document layout analysis, to perform various operations like validating, repairing, extending, and modifying text regions and lines.
Language:Python1 1 00
scrape-editorial-board
Scraping editorial board of journals
Language:Python9 4 02
tesseractXplore
tesseractXplore a tesseract ease of use gui with full control
Language:Python20 3 169

JKamlah's Repositories

JKamlah/AST-PST-Tablerecognizer
Recognize and extract AST-PST-Tables
Language:Python21
JKamlah/SlothGen
JKamlah/GTMake
Creating gitrepobased GT-Linepairs with ease
Language:Python11
JKamlah/tessdata_fast_ubma
Fast integer versions of trained LSTM models
JKamlah/tessdata_best_ubma
Best (most accurate) trained LSTM models.
JKamlah/page2tsv
PAGE-XML to TSV
JKamlah/BackgroundSubtractor4OCR
BackgroundSubtractor4OCR
Language:Python62
JKamlah/ratocer
RATOCER- Reichsanzeiger table of content extraction and recognition
Language:Python
JKamlah/RaiseWikibase
RaiseWikibase: Fast inserts into the BERD instance
Language:Python
JKamlah/tessdata_best
Best (most accurate) trained LSTM models.
JKamlah/KivyMD
KivyMD is a collection of Material Design compliant widgets for use with Kivy, a framework for cross-platform, touch-enabled graphical applications. https://youtube.com/c/KivyMD https://twitter.com/KivyMD https://habr.com/ru/users/kivymd https://stackoverflow.com/tags/kivymd
JKamlah/GTCheck
Check your modified Ground Truth files with visual support!
JKamlah/ocrd_contrib_ubma
Helper scripts for OCR-D
Language:Python
JKamlah/symspellpy
Python port of SymSpell
JKamlah/dinglehopper
An OCR evaluation tool
Language:Jupyter Notebook
JKamlah/okralact
A repository for online OCRD training infrastructure.
JKamlah/OCR_web
An OCR (tesseract) web interface to upload images. The idea of this project is to study technologies like Python, Django, Continuous Integration, Celery, etc...
JKamlah/the-book-of-secret-knowledge
A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.
JKamlah/python-cheatsheet
Comprehensive Python Cheatsheet
JKamlah/nashi
Some bits of javascript to transcribe scanned pages using PageXML
JKamlah/newspapers_regions_and_reading_order
Language:Python
JKamlah/AustrianNewspapers
NewsEye / READ OCR training dataset from Austrian Newspapers
JKamlah/scrape-editorial-board
Scraping editorial board of journals
Language:Python92
JKamlah/Wordhunt
Fuzzy Wordhunt
Language:Python
JKamlah/tesstrain
Train Tesseract LSTM with make
JKamlah/spleeter
Deezer source separation library including pretrained models.
JKamlah/hocr-tools
Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.
Language:Python
JKamlah/loc-db
This is the central component of the LOC-DB project.
Language:JavaScript
JKamlah/qt-box-editor
QT4 editor of tesseract-ocr box files
JKamlah/microservices-using-rabbitmq
Python & Go microservices on Docker, using RabbitMQ for asynchronous IPC
Language:Shell