fourierer

Live with mathematics and computer sciences

AlibabaBei Jing

fourierer's Stars

facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Language:Python30.7k 388 3.5k7.5k
junyanz/pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
Language:Python23.2k 348 1.5k6.3k
lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Language:Python12.8k 75 2731k
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
Language:Jupyter Notebook7.1k 76 217458
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python6.2k 52 630478
idealo/imagededup
😎 Finding duplicate images made easy!
Language:Python5.2k 64 125459
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python5.1k 49 453386
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python4.7k 38 1.5k432
clovaai/deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Language:Jupyter Notebook3.8k 84 3911.1k
aim-uofa/AdelaiDet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
Language:Python3.4k 84 546652
modelscope/swift
ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
Language:Python2.7k 19 758245
meijieru/crnn.pytorch
Convolutional recurrent network in pytorch
Language:Python2.4k 54 239657
microsoft/table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
Language:Python2.3k 39 143259
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
Language:Python2.1k 29 170145
MhLiao/DB
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
Language:Python2.1k 43 365481
kingyiusuen/image-to-latex
Convert images of LaTex math equations into LaTex code.
Language:Python2.1k 19 28314
X-PLUG/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Language:Python1.6k 30 118101
Sanster/text_renderer
Generate text images for training deep learning ocr model
Language:Python1.4k 43 104385
oh-my-ocr/text_renderer
Language:Python802 9 67161
chineseocr/trocr-chinese
transformers ocr for chinese
Language:Python362 8 5256
TianzhongSong/awesome-SynthText
A curated list of awesome synthetic data for text location and recognition
328 13 063
ViTAE-Transformer/DeepSolo
The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multilingual Text Spotting"
Language:Python250 7 6934
wenwenyu/TCM
Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)
Language:Jupyter Notebook184 13 1916
mlpc-ucsd/TESTR
(CVPR 2022) Text Spotting Transformers
Language:Python179 9 2322
ymy-k/DPText-DETR
[AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
Language:Python174 9 4222
mxin262/ESTextSpotter
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
Language:Python72 3 227
j-river/svtr-pytorch
pytorch version of svtr model
Language:Python19 1 63
vincezengqiang/caffe_ocr
主流ocr算法研究实验性的项目，目前实现了CNN+BLSTM+CTC架构
Language:C++1 0 00
vincezengqiang/text_renderer
Generate text images for training deep learning ocr model
Language:Python1 0 00
vincezengqiang/trocr-chinese
transformers ocr for chinese
Language:Python1 0 00

fourierer

fourierer's Stars

facebookresearch/detectron2

junyanz/pytorch-CycleGAN-and-pix2pix

lukas-blecher/LaTeX-OCR

OpenBMB/MiniCPM

OpenGVLab/InternVL

idealo/imagededup

QwenLM/Qwen-VL

InternLM/lmdeploy

clovaai/deep-text-recognition-benchmark

aim-uofa/AdelaiDet

modelscope/swift

meijieru/crnn.pytorch

microsoft/table-transformer

THUDM/CogVLM2

MhLiao/DB

kingyiusuen/image-to-latex

X-PLUG/mPLUG-DocOwl

Sanster/text_renderer

oh-my-ocr/text_renderer

chineseocr/trocr-chinese

TianzhongSong/awesome-SynthText

ViTAE-Transformer/DeepSolo

wenwenyu/TCM

mlpc-ucsd/TESTR

ymy-k/DPText-DETR

mxin262/ESTextSpotter

j-river/svtr-pytorch

vincezengqiang/caffe_ocr

vincezengqiang/text_renderer

vincezengqiang/trocr-chinese