1994cxy's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
CompVis/stable-diffusion
A latent text-to-image diffusion model
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Lightning-AI/pytorch-lightning
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
academicpages/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
XavierXiao/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
cocodataset/cocoapi
COCO API - Dataset @ http://cocodataset.org/
CompVis/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
hojonathanho/diffusion
Denoising Diffusion Probabilistic Models
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
riffusion/riffusion
Stable diffusion for real-time music generation
mne-tools/mne-python
MNE: Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
haoheliu/AudioLDM2
Text-to-Audio/Music Generation
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
thu-ml/unidiffuser
Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
rmokady/CLIP_prefix_caption
Simple image captioning model
SHI-Labs/Versatile-Diffusion
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
XiangLi1999/Diffusion-LM
Diffusion-LM
google-research/pix2seq
Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)
csteinmetz1/pyloudnorm
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
ChenFengYe/motion-latent-diffusion
[CVPR 2023] Executing your Commands via Motion Diffusion in Latent Space, a fast and high-quality motion diffusion model
HuthLab/semantic-decoding
justinlovelace/latent-diffusion-for-language
torchDDM/DDM
HuthLab/deep-fMRI-dataset
Code accompanying data release of natural language listening fMRI data (LeBel et al.)
GonyRosenman/TFF
an end to end framework for analyzing fMRI time-series data (4D) using transformers