Pinned Repositories
parseq
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
CLEval
CLEval: Character-Level Evaluation for Text Detection and Recognition Tasks
MixNet
large-ocr-model.github.io
MTL-TabNet
MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition
TexTeller
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.
unitable
UniTable: Towards a Unified Table Foundation Model
UVDoc
Code for the paper "UVDoc: Neural Grid-based Document Unwarping"
CLIP4STR
An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".
DPText-DETR
[AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
lerndeep's Repositories
lerndeep doesn’t have any repository yet.