rkshuai's Stars
ggerganov/llama.cpp
LLM inference in C/C++
chenfei-wu/TaskMatrix
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
google/yapf
A formatter for Python files
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
aim-uofa/AdelaiDet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
google/prompt-to-prompt
CLUEbenchmark/SuperCLUE
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
MhLiao/DB
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
vivo-ai-lab/BlueLM
BlueLM(蓝心大模型): Open large language models developed by vivo AI Lab
XiaohangZhan/deocclusion
Code for our CVPR 2020 work.
Maknee/minigpt4.cpp
Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)
LinXueyuanStdio/LaTeX_OCR
:gem: 数学公式识别 Math Formula OCR
fh2019ustc/DocTr
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
xiaoyu258/DocProj
Document Rectification and Illumination Correction using a Patch-based CNN
taeho-kil/Document-Image-Dewarping
Document Image Dewarping
MichalBusta/E2E-MLT
E2E-MLT - an Unconstrained End-to-End Method for Multi-Language Scene Text
Rid7/Table-OCR
Recognize tables from images and restore them into word.
ocrbook/ocrinaction
baaivision/CapsFusion
[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale
gwxie/Dewarping-Document-Image-By-Displacement-Flow-Estimation
Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network
limengyang1992/seq2seq-layout-analysis
end2end layout analysis based seq2seq
BADBADBADBOY/DBnet-lite.pytorch
A pytorch re-implementation of Real-time Scene Text Detection with Differentiable Binarization
open-compass/MMBench
Official Repo of "MMBench: Is Your Multi-modal Model an All-around Player?"
lee-man/movenet
Un-official implementation of MoveNet from Google
JianshuZhang/TreeDecoder
A Tree-Structured Decoder for Image-to-Markup Generation
tommyMessi/waveCorrection
OCR Document image deformation correction.复现阿里OCR皱巴巴文档图像形变矫正
AtiqueUrRehman/qaida
Large scale font independent printed Urdu text data set
kailigo/cddod
Project page for "Cross-Domain Document Object Detection: Benchmark Suite and Method, CVPR 2020"
VIStA-H/GPT-4V_Social_Media
GPT-4V(ision) as A Social Media Analysis Engine