1994cxy

1994cxy's Stars

huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python137k 1.1k 16.4k27.4k
CompVis/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook69k 563 71710.2k
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python39.7k 448 3155.1k
Lightning-AI/pytorch-lightning
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Language:Python28.7k 252 7.2k3.4k
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook26.8k 325 4053.4k
academicpages/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript12.9k 92 37644.8k
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Jupyter Notebook12.2k 98 3481.6k
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python11k 168 8132.5k
XavierXiao/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
Language:Jupyter Notebook7.6k 92 149795
cocodataset/cocoapi
COCO API - Dataset @ http://cocodataset.org/
Language:Jupyter Notebook6.1k 112 5613.8k
CompVis/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
Language:Jupyter Notebook5.9k 75 2211.2k
hojonathanho/diffusion
Denoising Diffusion Probabilistic Models
Language:Python4k 20 22383
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Language:Python3.8k 31 263347
riffusion/riffusion
Stable diffusion for real-time music generation
Language:Python3.3k 38 93380
mne-tools/mne-python
MNE: Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python
Language:Python2.8k 83 4.9k1.3k
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Language:Python2.5k 42 110227
haoheliu/AudioLDM2
Text-to-Audio/Music Generation
Language:Python2.3k 44 72184
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
Language:Python1.5k 29 94148
thu-ml/unidiffuser
Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
Language:Python1.4k 17 3287
rmokady/CLIP_prefix_caption
Simple image captioning model
Language:Jupyter Notebook1.3k 7 81218
SHI-Labs/Versatile-Diffusion
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
Language:Python1.3k 28 3484
XiangLi1999/Diffusion-LM
Diffusion-LM
Language:Python1.1k 17 71140
google-research/pix2seq
Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)
Language:Jupyter Notebook884 19 4971
csteinmetz1/pyloudnorm
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
Language:Python664 15 3657
ChenFengYe/motion-latent-diffusion
[CVPR 2023] Executing your Commands via Motion Diffusion in Latent Space, a fast and high-quality motion diffusion model
Language:Python610 11 6155
HuthLab/semantic-decoding
Language:Python203 7 836
justinlovelace/latent-diffusion-for-language
Language:Python115 4 1617
torchDDM/DDM
Language:Python71 1 410
HuthLab/deep-fMRI-dataset
Code accompanying data release of natural language listening fMRI data (LeBel et al.)
Language:Python57 5 1110
GonyRosenman/TFF
an end to end framework for analyzing fMRI time-series data (4D) using transformers
Language:Python44 2 713