sumith1896

Insanely passionate about Computer Science.

@black-forest-labsPalo Alto, CA

sumith1896's Stars

openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python79.1k 638 09.5k
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Language:Python72.8k 515 5k7.9k
CompVis/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook70.2k 572 72310.4k
rclone/rclone
"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files
Language:Go49.6k 573 5.7k4.4k
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python40.6k 453 3225.2k
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python40.4k 398 3266.7k
lllyasviel/ControlNet
Let us control diffusion models!
Language:Python31.9k 223 5642.8k
Stability-AI/generative-models
Generative Models by Stability AI
Language:Python25.6k 265 3192.8k
black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python21.1k 175 2041.5k
voxel51/fiftyone
Refine high-quality datasets and visual AI models
Language:Python9.3k 65 1.6k608
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Language:Python7.7k 56 1971.3k
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
Language:Jupyter Notebook7.3k 55 142488
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python7k 46 88636
timothybrooks/instruct-pix2pix
Language:Python6.6k 70 132553
NVlabs/eg3d
Language:Python3.3k 156 113360
state-spaces/s4
Structured state space sequence models
Language:Jupyter Notebook2.6k 52 140314
crowsonkb/k-diffusion
Karras et al. (2022) diffusion models for PyTorch
Language:Python2.4k 42 67384
openai/consistencydecoder
Consistency Distilled Diff VAE
Language:Python2.2k 20 2076
MineDojo/MineDojo
Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Language:Java1.9k 29 128172
ShieldMnt/invisible-watermark
python library for invisible image watermark (blind image watermark)
Language:Python1.7k 14 31155
NVlabs/edm
Elucidating the Design Space of Diffusion-Based Generative Models (EDM)
Language:Python1.6k 30 27159
shubham-goel/4D-Humans
4DHumans: Reconstructing and Tracking Humans with Transformers
Language:Python1.3k 23 161131
GaParmar/clean-fid
PyTorch - FID calculation with proper image resizing and quantization steps [CVPR 2022]
Language:Python1k 8 5272
lucidrains/muse-maskgit-pytorch
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
Language:Python892 33 3684
HTDerekLiu/BlenderToolbox
Some simple Blender scripts for rendering paper figures
Language:Python712 8 1569
iejMac/video2dataset
Easily create large video dataset from video urls
Language:Python592 9 15670
LAION-AI/aesthetic-predictor
A linear estimator on top of clip to predict the aesthetic quality of pictures
Language:Jupyter Notebook532 13 719
google-research/maskgit
Official Jax Implementation of MaskGIT
Language:Jupyter Notebook499 17 1251
lucidrains/recurrent-interface-network-pytorch
Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in Pytorch
Language:Python202 11 1714
certik/fastGPT
Fast GPT-2 inference written in Fortran
Language:Fortran194 8 2018

sumith1896

sumith1896's Stars

openai/whisper

comfyanonymous/ComfyUI

CompVis/stable-diffusion

rclone/rclone

Stability-AI/stablediffusion

karpathy/nanoGPT

lllyasviel/ControlNet

Stability-AI/generative-models

black-forest-labs/flux

voxel51/fiftyone

facebookresearch/mae

cloneofsimo/lora

facebookresearch/DiT

timothybrooks/instruct-pix2pix

NVlabs/eg3d

state-spaces/s4

crowsonkb/k-diffusion

openai/consistencydecoder

MineDojo/MineDojo

ShieldMnt/invisible-watermark

NVlabs/edm

shubham-goel/4D-Humans

GaParmar/clean-fid

lucidrains/muse-maskgit-pytorch

HTDerekLiu/BlenderToolbox

iejMac/video2dataset

LAION-AI/aesthetic-predictor

google-research/maskgit

lucidrains/recurrent-interface-network-pytorch

certik/fastGPT