gxlarson's Stars
gxlarson/RVL-CDIP-OOD
surge-ai/surge-python
Python SDK for Surge AI API
elangovana/nlp-train-test-overlap-detector
sparkfish/shabby-pages
ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for use in training models to reverse distortions and recover to original denoised documents.
sparkfish/augraphy
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
LaihoE/did-it-spill
Check if you have training samples in your test set
amandacurry/convabuse
doc-analysis/ReadingBank
ReadingBank: A Benchmark Dataset for Reading Order Detection
QuickSign/ocrized-text-dataset
Quicksign OCRized Text Dataset (QS-OCR)
EasyTensor/python-client
The official python client for EasyTensor
AyanGadpal/Document-Image-Augmentation
Document Image Augmentation is tool for Augmenting axis align document images
Jacobsolawetz/large-scale-oie