4m4n5's Stars
lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
borisdayma/dalle-mini
DALL·E Mini - Generate images from a text prompt
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
mlfoundations/open_clip
An open source implementation of CLIP.
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
lucidrains/imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
openai/guided-diffusion
beamandrew/medical-data
CompVis/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
openai/glide-text2im
GLIDE: a diffusion-based text-conditional image synthesis model
libffcv/ffcv
FFCV: Fast Forward Computer Vision (and other ML workloads!)
sfikas/medical-imaging-datasets
A list of Medical imaging datasets.
willard-yuan/awesome-cbir-papers
📝Awesome and classical image retrieval papers
Meituan-AutoML/MobileVLM
Strong and Open Vision Language Assistant for Mobile Devices
microsoft/VQ-Diffusion
Official implementation of VQ-Diffusion
Zasder3/train-CLIP
A PyTorch Lightning solution to training OpenAI's CLIP from scratch.
cientgu/VQ-Diffusion
GuyTevet/MotionCLIP
Official Pytorch implementation of the paper "MotionCLIP: Exposing Human Motion Generation to CLIP Space"
CasualGANPapers/Make-A-Scene
Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
valeoai/obow
revantteotia/clip-training
Code to train CLIP model
cpeng93/DiffuseRecon
Towards performant and reliable undersampled MR reconstruction via diffusion model sampling
rahular/itihasa
A large scale Sanskrit-English translation dataset
lgbwust/awesome-image-retrieval-papers
naver-ai/mid.metric
raeidsaqur/mgn
Multimodal Graph Network (MGN): Code repo, examples from the paper
1iyiwei/research-templates
LaTeX templates I created for authoring research papers
cviaai/AF-PLUS
Official MICCAI-2022 submission repository