sumith1896's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
CompVis/stable-diffusion
A latent text-to-image diffusion model
rclone/rclone
"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
lllyasviel/ControlNet
Let us control diffusion models!
Stability-AI/generative-models
Generative Models by Stability AI
black-forest-labs/flux
Official inference repo for FLUX.1 models
voxel51/fiftyone
Refine high-quality datasets and visual AI models
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
timothybrooks/instruct-pix2pix
NVlabs/eg3d
state-spaces/s4
Structured state space sequence models
crowsonkb/k-diffusion
Karras et al. (2022) diffusion models for PyTorch
openai/consistencydecoder
Consistency Distilled Diff VAE
MineDojo/MineDojo
Building Open-Ended Embodied Agents with Internet-Scale Knowledge
ShieldMnt/invisible-watermark
python library for invisible image watermark (blind image watermark)
NVlabs/edm
Elucidating the Design Space of Diffusion-Based Generative Models (EDM)
shubham-goel/4D-Humans
4DHumans: Reconstructing and Tracking Humans with Transformers
GaParmar/clean-fid
PyTorch - FID calculation with proper image resizing and quantization steps [CVPR 2022]
lucidrains/muse-maskgit-pytorch
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
HTDerekLiu/BlenderToolbox
Some simple Blender scripts for rendering paper figures
iejMac/video2dataset
Easily create large video dataset from video urls
LAION-AI/aesthetic-predictor
A linear estimator on top of clip to predict the aesthetic quality of pictures
google-research/maskgit
Official Jax Implementation of MaskGIT
lucidrains/recurrent-interface-network-pytorch
Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in Pytorch
certik/fastGPT
Fast GPT-2 inference written in Fortran