echo-hmwang's Stars
facebookresearch/simsiam
PyTorch implementation of SimSiam https//arxiv.org/abs/2011.10566
zhoubolei/introRL
Intro to Reinforcement Learning (强化学习纲要)
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
khawar-islam/diffuseMix
Official PyTorch implementation of DiffuseMix : Label-Preserving Data Augmentation with Diffusion Models (CVPR'2024)
brandontrabucco/da-fusion
Effective Data Augmentation With Diffusion Models
Shark-NLP/DiffuSeq
[ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
sp-uhh/sgmse
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
Lightning-AI/pytorch-lightning
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
Rongjiehuang/FastDiff
PyTorch Implementation of FastDiff (IJCAI'22)
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
MishaLaskin/vqvae
A pytorch implementation of the vector quantized variational autoencoder (https://arxiv.org/abs/1711.00937)
karpathy/deep-vector-quantization
VQVAEs, GumbelSoftmaxes and friends
ldeecke/gmm-torch
Gaussian mixture models in PyTorch.
iMeleon/EECS-498-007-598-005-solutions
EECS 498-007 / 598-005 Deep Learning for Computer Vision
facebookresearch/fairseq2
FAIR Sequence Modeling Toolkit 2
HigherOrderCO/Bend
A massively parallel, high-level programming language
DmitryRyumin/INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
DmitryRyumin/ICASSP-2023-24-Papers
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
hyunwoongko/transformer
Transformer: PyTorch Implementation of "Attention Is All You Need"
Rongjiehuang/TranSpeech
PyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation
YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
salah-zaiem/augmentations_adaptation
jax-ml/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
light1726/Speech-Tokenization-Papers
This repository follows papers and reports on discrete speech representation learning and speech tokenization methods for speech language modeling.
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
openai/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
XiangLi1999/PrefixTuning
Prefix-Tuning: Optimizing Continuous Prompts for Generation
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".