erfect2020's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
sail-sg/EditAnything
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
baaivision/Painter
Painter & SegGPT Series: Vision Foundation Models from BAAI
facebookresearch/multimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
open-mmlab/Multimodal-GPT
Multimodal-GPT
wangkai930418/awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
Linfeng-Tang/Image-Fusion
Deep Learning-based Image Fusion: A Survey
greenbellpepper/GreenPepper
xinntao/HandyView
Handy image viewer based on PyQt5. Convenient for viewing and comparing :-)
magic-research/bubogpt
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs
Yangzhangcst/Mamba-in-CV
A paper list of some recent Mamba-based CV works.
HXMap/MapQR
[ECCV 2024] This is the official implementation of MapQR, an end-to-end method with an emphasis on enhancing query capabilities for constructing online vectorized maps.
wd1511/Awesome-Diffusion-for-Image-Translation
A collection of papers on Diffusion for Image-to-Image Translation and Style Transfer
wd1511/PDNLA-Net
Unsupervised Deep Exemplar Colorization via Pyramid Dual Non-local Attention (TIP 2023)
Aitical/MCLIR
[AAAI 2024] Learning from History: Task-agnostic Model Contrastive Learning for Image Restoration
OpenGVLab/InternLMM
MACderRu/Guide-and-Rescale
Official Implementation for "Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing"
JingWu321/Privacy-preserving-in-FL
erfect2020/ContextualDeblur
ViTDeblur: A Hierarchical Model for Defocus Deblurring (IEEE TCI 2024)
i-xiaohu/bioconda-recipes
Conda recipes for the bioconda channel.
i-xiaohu/CompSeed
Compressive version of BWA-MEM seeding.