hijupiter

hijupiter's Stars

NVlabs/SegFormer
Official PyTorch implementation of SegFormer
Language:Python2.5k349
CSAILVision/ADE20K
ADE20K Dataset
Language:Jupyter Notebook31654
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook47k5.6k
duxiangcheng/SAEN
Modeling Stroke Mask for End-to-End Text Erasing
Language:Python14
rohitgandikota/erasing
Erasing Concepts from Diffusion Models
Language:Python52335
yeungchenwa/Recommendations-Diffusion-Text-Image
A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text removal, text image super resolution, text editing, handwritten generation, scene text recognition and scene text detection.
1914
yeungchenwa/OCR-SAM
Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting
Language:Python51937
CandleLabAI/TPFNet
Language:Python91
rezazad68/BCDUnet_DIBCO
Documnet Image Binarization, DIBCO Challenges
Language:Python4111
ajgallego/document-image-binarization
A selectional auto-encoder approach for document image binarization
Language:Python10123
qurator-spk/eynollah
Document Layout Analysis
Language:Python34029
datawhalechina/leedl-tutorial
《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases
Language:Jupyter Notebook13.3k2.9k
DA-southampton/TRM_tutorial
Transformer在CV和NLP领域的变体模型的从零解读：Transformer；VIT；Swin Transformer
31943
RisabBiswas/T2T-BinFormer
SOTA Document Image Enhancement - T2T-BinFormer: Effective Document Image Enhancement Using tokens-to-token Transformer Network
Language:Python151
dali92002/DocEnTR
DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022
Language:Jupyter Notebook13833
phamquiluan/jdeskew
ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation
Language:Jupyter Notebook12310
deepdoctection/deepdoctection
A Repo For Document AI
Language:Python2.5k135
attendfov/chinese-layoutlm-v2
中文文档理解多模态语言模型，支持多模态文档信息抽取，文档embedding
910
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Language:Python30.1k7.4k
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python19.6k2.5k
Helen-Cheung/Baidu-AI-Challenge-Scene-Text-Removal
Language:Python81
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
Language:Jupyter Notebook9.2k1.4k
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
Language:C++1.4k165
Zasder3/train-CLIP
A PyTorch Lightning solution to training OpenAI's CLIP from scratch.
Language:Python65579
rmokady/CLIP_prefix_caption
Simple image captioning model
Language:Jupyter Notebook1.3k214
RapidAI/RapidLaTeXOCR
Formula recognition based on LaTeX-OCR and ONNXRuntime.
Language:Python28027
biswassanket/DocSegTr
A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers
Language:Python579
FeiGeChuanShu/DocTr-ncnn
ncnn demo of (文档矫正)DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction
Language:C++426
philschmid/document-ai-transformers
Language:Jupyter Notebook31847
shabie/docformer
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)
Language:Python25440