Pinned Repositories
parseq
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
CLEval
CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks
MixNet
large-ocr-model.github.io
MTL-TabNet
MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition
TexTeller
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.
unitable
UniTable: Towards a Unified Table Foundation Model
UVDoc
Code for the paper "UVDoc: Neural Grid-based Document Unwarping"
DPText-DETR
[AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
DocAligner
[arXiv 2023] DocAligner: Automating the Annotation of Photographed Documents Through Real-virtual Alignment
lerndeep's Repositories
lerndeep doesn’t have any repository yet.