a1004123217's Stars
ming053l/DRCT
Accepted by New Trends in Image Restoration and Enhancement workshop (NTIRE), in conjunction with CVPR 2024.
geekyutao/Image-Inpainting
A paper summary of image inpainting
PrajitR/fast-pixel-cnn
Speed up PixelCNN++ image generation by up to a 183 times
YaoFANGUK/video-subtitle-remover
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
QWERDF007/LearningDL
记录学习深度学习的一些
Stability-AI/generative-models
Generative Models by Stability AI
ssut/py-googletrans
(unofficial) Googletrans: Free and Unlimited Google translate API for Python. Translates totally free of charge.
chs20/RobustVLM
[ICML 2024] Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models
mlfoundations/open_clip
An open source implementation of CLIP.
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
mhamilton723/FeatUp
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
OpenGVLab/video-mamba-suite
The suite of modeling video with Mamba
abdur75648/UTRNet-High-Resolution-Urdu-Text-Recognition
UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)
ianand/spreadsheets-are-all-you-need
xai-org/grok-1
Grok open release
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
deepinsight/insightface
State-of-the-art 2D and 3D Face Analysis Project
xuehuachunsheng/DupImageDetection
海量图片去重算法-局部分块Hash算法
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
wenwenyu/TCM
Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
gaoxiang12/slambook
HRNet/Lite-HRNet
This is an official pytorch implementation of Lite-HRNet: A Lightweight High-Resolution Network.
RapidAI/RapidLaTeXOCR
Formula recognition based on LaTeX-OCR and ONNXRuntime.
layumi/Person_reID_baseline_pytorch
:bouncing_ball_person: Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
clovaai/deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
milely/OCR_paper
Papers in the field of OCR
visionml/pytracking
Visual tracking library based on PyTorch.
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.