document-image-processing

There are 17 repositories under document-image-processing topic.

Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Language:HTML9.6k 64 1.1k809
Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
Language:Python5k 74 150476
fh2019ustc/Awesome-Document-Image-Rectification
A comprehensive list of awesome document image rectification papers.
382 13 429
fh2019ustc/DocTr
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
Language:Python363 17 3150
hpanwar08/detectron2
Detectron2 for Document Layout Analysis
Language:Python185 8 4763
fh2019ustc/DocScanner
The official repo for “DocScanner: Robust Document Image Rectification with Progressive Learning”.
Language:Python176 18 920
fh2019ustc/DocGeoNet
The official code for “Geometric Representation Learning for Document Image Rectification”, ECCV, 2022.
Language:Python76 6 112
jiangnanboy/Doc-Image-Tool
文档图像处理工具(Document image processing tool)，包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSharpening / HandwritingDenoisingBeautifying / DocShadowRemoval / document_image_dewarping / DocTrimmingEnhancement)。
Language:Python22 1 46
Nomiluks/Handwritting-OCR
Android App for English Handwritten Text Recognition
Language:Java15 3 25
caltechlibrary/documentarist
Process Caltech Archives' digital documents and photos, and annotate each page or image with information about its contents
Language:Python12 7 04
jchazalon/smartdoc15-ch1-pywrapper
Python wrapper to facilitate data manipulation for the SmartDoc 2015 - Challenge 1 Dataset.
Language:Jupyter Notebook6 1 02
Transkribus/competitions
The ScriptNet / competitions site.
Language:Python6 12 1116
tony-xlh/quality-evaluation-of-scanned-document-images
A web app evaluating the quality the scanned document images
Language:HTML3 2 01
ajaycode/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Language:HTML2 0 00
jiangnanboy/docimg_tool
复杂背景图像漂白，文字方向矫正，清晰增强，笔记去噪美化，去阴影，扭曲矫正，去黑点以及切边增强。complex background image bleaching, text direction correction, clarity enhancement, note to blur beautification, shadow removal, distortion correction, black spots removal and cutting edge enhancement。
2 1 0
YuanSiping/Similar-Document-Image-Retrieval-Dataset
0 2 03
sfikas/sophia-trikoupi-handwritten-dataset
Sophia Trikoupi dataset (Collection of 46 handwritten, annotated pages)
Language:Python1 0

document-image-processing

Unstructured-IO/unstructured

Layout-Parser/layout-parser

fh2019ustc/Awesome-Document-Image-Rectification

fh2019ustc/DocTr

hpanwar08/detectron2

fh2019ustc/DocScanner

fh2019ustc/DocGeoNet

jiangnanboy/Doc-Image-Tool

Nomiluks/Handwritting-OCR

caltechlibrary/documentarist

jchazalon/smartdoc15-ch1-pywrapper

Transkribus/competitions

tony-xlh/quality-evaluation-of-scanned-document-images

ajaycode/unstructured

jiangnanboy/docimg_tool

YuanSiping/Similar-Document-Image-Retrieval-Dataset

sfikas/sophia-trikoupi-handwritten-dataset