a1004123217

a1004123217's Stars

ming053l/DRCT
Accepted by New Trends in Image Restoration and Enhancement workshop (NTIRE), in conjunction with CVPR 2024.
Language:Jupyter Notebook16013
geekyutao/Image-Inpainting
A paper summary of image inpainting
Language:Python811101
PrajitR/fast-pixel-cnn
Speed up PixelCNN++ image generation by up to a 183 times
Language:Python48063
YaoFANGUK/video-subtitle-remover
基于AI的图片/视频硬字幕去除、文本水印去除，无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API，本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
Language:Python4.1k526
QWERDF007/LearningDL
记录学习深度学习的一些
Language:Jupyter Notebook171
Stability-AI/generative-models
Generative Models by Stability AI
Language:Python24.3k2.7k
ssut/py-googletrans
(unofficial) Googletrans: Free and Unlimited Google translate API for Python. Translates totally free of charge.
Language:Python3.9k717
chs20/RobustVLM
[ICML 2024] Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models
Language:Python913
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python9.9k959
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Language:Python1.3k85
mhamilton723/FeatUp
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
Language:Jupyter Notebook1.3k78
OpenGVLab/video-mamba-suite
The suite of modeling video with Mamba
Language:Python21921
abdur75648/UTRNet-High-Resolution-Urdu-Text-Recognition
UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)
Language:Python4310
ianand/spreadsheets-are-all-you-need
1.1k176
xai-org/grok-1
Grok open release
Language:Python49.5k8.3k
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Language:Python31.7k4.7k
deepinsight/insightface
State-of-the-art 2D and 3D Face Analysis Project
Language:Python23k5.4k
xuehuachunsheng/DupImageDetection
海量图片去重算法-局部分块Hash算法
Language:Python9221
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook25k3.2k
wenwenyu/TCM
Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)
Language:Jupyter Notebook17414
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
Language:Python29.2k9.4k
gaoxiang12/slambook
Language:C++6.9k3.3k
HRNet/Lite-HRNet
This is an official pytorch implementation of Lite-HRNet: A Lightweight High-Resolution Network.
Language:Python826127
RapidAI/RapidLaTeXOCR
Formula recognition based on LaTeX-OCR and ONNXRuntime.
Language:Python28027
layumi/Person_reID_baseline_pytorch
:bouncing_ball_person: Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial
Language:Python4.1k1k
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.3k6.4k
clovaai/deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Language:Jupyter Notebook3.7k1.1k
milely/OCR_paper
Papers in the field of OCR
9
visionml/pytracking
Visual tracking library based on PyTorch.
Language:Python3.2k605
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
Language:C++1.4k165