erfect2020

erfect2020's Stars

Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python170k 1.5k 3k44.7k
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Language:Python32.8k 318 9454.8k
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15.5k 115 3951.4k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
13.3k 259 129846
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook10.1k 97 678982
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Language:Python7.5k 57 1931.2k
sail-sg/EditAnything
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
Language:Python3.4k 40 57195
baaivision/Painter
Painter & SegGPT Series: Vision Foundation Models from BAAI
Language:Python2.5k 37 71176
facebookresearch/multimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
Language:Python1.5k 21 40146
open-mmlab/Multimodal-GPT
Multimodal-GPT
Language:Python1.5k 13 20126
wangkai930418/awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
1.4k 59 1966
Linfeng-Tang/Image-Fusion
Deep Learning-based Image Fusion: A Survey
Language:MATLAB878 9 10136
greenbellpepper/GreenPepper
877 25 1119
xinntao/HandyView
Handy image viewer based on PyQt5. Convenient for viewing and comparing :-)
Language:Python595 13 1265
magic-research/bubogpt
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs
Language:Python506 10 1935
Yangzhangcst/Mamba-in-CV
A paper list of some recent Mamba-based CV works.
242 5 113
HXMap/MapQR
[ECCV 2024] This is the official implementation of MapQR, an end-to-end method with an emphasis on enhancing query capabilities for constructing online vectorized maps.
Language:Python168 6 1511
wd1511/Awesome-Diffusion-for-Image-Translation
A collection of papers on Diffusion for Image-to-Image Translation and Style Transfer
Language:Python148 3 017
wd1511/PDNLA-Net
Unsupervised Deep Exemplar Colorization via Pyramid Dual Non-local Attention (TIP 2023)
Language:Python37 2 50
Aitical/MCLIR
[AAAI 2024] Learning from History: Task-agnostic Model Contrastive Learning for Image Restoration
33 2 43
OpenGVLab/InternLMM
17 1 00
MACderRu/Guide-and-Rescale
Official Implementation for "Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing"
Language:Jupyter Notebook9 0 00
JingWu321/Privacy-preserving-in-FL
Language:Python5 1 12
erfect2020/ContextualDeblur
ViTDeblur: A Hierarchical Model for Defocus Deblurring (IEEE TCI 2024)
Language:Python3 1 30
i-xiaohu/bioconda-recipes
Conda recipes for the bioconda channel.
Language:Shell2
i-xiaohu/CompSeed
Compressive version of BWA-MEM seeding.
Language:C2