hijupiter

hijupiter's Stars

rohitgandikota/erasing
Erasing Concepts from Diffusion Models
Language:Python48932
yeungchenwa/Recommendations-Diffusion-Text-Image
A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text removal, text image super resolution, text editing, handwritten generation, scene text recognition and scene text detection.
1613
yeungchenwa/OCR-SAM
Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting
Language:Python49137
CandleLabAI/TPFNet
Language:Python91
rezazad68/BCDUnet_DIBCO
Documnet Image Binarization, DIBCO Challenges
Language:Python4110
ajgallego/document-image-binarization
A selectional auto-encoder approach for document image binarization
Language:Python10123
qurator-spk/eynollah
Document Layout Analysis
Language:Python32826
datawhalechina/leedl-tutorial
《李宏毅深度学习教程》（李宏毅老师推荐👍），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases
Language:Jupyter Notebook11.2k2.7k
DA-southampton/TRM_tutorial
Transformer在CV和NLP领域的变体模型的从零解读：Transformer；VIT；Swin Transformer
31243
RisabBiswas/T2T-BinFormer
SOTA Document Image Enhancement - T2T-BinFormer: Effective Document Image Enhancement Using tokens-to-token Transformer Network
Language:Python131
dali92002/DocEnTR
DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022
Language:Jupyter Notebook13432
phamquiluan/jdeskew
ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation
Language:Jupyter Notebook11010
deepdoctection/deepdoctection
A Repo For Document AI
Language:Python2.4k120
attendfov/chinese-layoutlm-v2
中文文档理解多模态语言模型，支持多模态文档信息抽取，文档embedding
68
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Language:Python29.4k7.3k
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python19.2k2.4k
Helen-Cheung/Baidu-AI-Challenge-Scene-Text-Removal
Language:Python81
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
Language:Jupyter Notebook8.6k1.3k
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
Language:C++1.2k148
Zasder3/train-CLIP
A PyTorch Lightning solution to training OpenAI's CLIP from scratch.
Language:Python63478
rmokady/CLIP_prefix_caption
Simple image captioning model
Language:Jupyter Notebook1.3k213
RapidAI/RapidLaTeXOCR
Formula recognition based on LaTeX-OCR and ONNXRuntime.
Language:Python25426
biswassanket/DocSegTr
A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers
Language:Python499
FeiGeChuanShu/DocTr-ncnn
ncnn demo of (文档矫正)DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction
Language:C++414
philschmid/document-ai-transformers
Language:Jupyter Notebook29140
shabie/docformer
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)
Language:Python24940
Sierkinhane/CRNN_Chinese_Characters_Rec
(CRNN) Chinese Characters Recognition.
Language:Python1.8k537
vkgo/OCRAutoScore
OCR自动化阅卷项目
Language:Python14042
AstarLight/Lets_OCR
A repository for OCR, which inlcudes some classical OCR algorithms Pytorch implementation such as CTPN, EAST and CRNN.
Language:C++649329
AstarLight/CPS-OCR-Engine
An awesome OCR engine developed by SYSU DeepDriving Lab
Language:Python1.1k513