natanielruiz
Research Scientist at Google | DreamBooth and Personalization of Generative Models
GoogleBoston
natanielruiz's Stars
CompVis/stable-diffusion
A latent text-to-image diffusion model
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
huggingface/trl
Train transformer language models with reinforcement learning.
pytorch/captum
Model interpretability and understanding for PyTorch
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
facebookresearch/deit
Official DeiT repository
openai/glide-text2im
GLIDE: a diffusion-based text-conditional image synthesis model
openai/improved-diffusion
Release for Improved Denoising Diffusion Probabilistic Models
allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
hila-chefer/Transformer-Explainability
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
gnobitab/InstaFlow
:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)
PRIV-Creation/Awesome-Controllable-T2I-Diffusion-Models
A collection of resources on controllable generation with text-to-image diffusion models.
google/dreambooth
jacobgil/vit-explain
Explainability for Vision Transformers
cognitivecomputations/github2file
bcmi/libcom
Image composition toolbox: everything you want to know about image composition or object insertion
mkshing/ziplora-pytorch
Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"
thuanz123/realfill
Unofficial implementation of RealFill
kpandey008/DiffuseVAE
Official implementation of "DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents"
mkshing/e4t-diffusion
Implementation of Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models
sayakpaul/probing-vits
Probing the representations of Vision Transformers.
akashsengupta1997/HierarchicalProbabilistic3DHuman
Code repository for the paper: Hierarchical Kinematic Probability Distributions for 3D Human Shape and Pose Estimation from Images in the Wild (ICCV 2021)
Muzammal-Naseer/IPViT
Official repository for "Intriguing Properties of Vision Transformers" (NeurIPS 2021--Spotlight)
mkshing/prompt-plus-pytorch
Implementation of P+: Extended Textual Conditioning in Text-to-Image Generation
NVlabs/High-res-disentanglement-datasets
Datasets for new state-of-the-art challenge in disentanglement learning
nottombrown/imagenet-stubs
A teeny tiny set of ImageNet-like images for testing pipelines