XuesongYang's Stars
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
state-spaces/mamba
Mamba SSM architecture
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
soumith/ganhacks
starter from "How to Train a GAN?" at NIPS2016
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
arogozhnikov/einops
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
tlkh/asitop
Perf monitoring CLI tool for Apple Silicon
rlworkgroup/garage
A toolkit for reproducible reinforcement learning research.
mozillazg/pinyin-data
汉字拼音数据
chq1155/A-Survey-on-Generative-Diffusion-Model
google/visqol
Perceptual Quality Estimator for speech and audio
EmulationAI/awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
yangdongchao/AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
sp-uhh/sgmse
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
auspicious3000/contentvec
speech self-supervised representations
Newbeeer/pfgmpp
Code for ICML 2023 paper, "PFGM++: Unlocking the Potential of Physics-Inspired Generative Models"
NVIDIA/CleanUNet
Official PyTorch Implementation of CleanUNet (ICASSP 2022)
NVIDIA/NeMo-text-processing
NeMo text processing for ASR and TTS
Srijith-rkr/Whispering-LLaMA
EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction
NVIDIA/audio-flamingo
PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.
dynamic-superb/dynamic-superb
The official repository of Dynamic-SUPERB.
NVIDIA/NeMo-speech-data-processor
A toolkit for processing speech data and creating speech datasets
NVIDIA/NeMo-Run
A tool to configure, launch and manage your machine learning experiments.
sevagh/audio-degradation-toolbox
easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox
paarthneekhara/NeMo
NeMo: a toolkit for conversational AI
blisc/NeMo
Neural Modules: a toolkit for conversational AI
SungFeng-Huang/NeMo
NeMo: a toolkit for conversational AI