XuesongYang

NvidiaSeattle, WA

XuesongYang's Stars

labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python57.8k 461 1335.9k
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.8k 424 4.2k6.4k
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
27.8k 291 432.3k
state-spaces/mamba
Mamba SSM architecture
Language:Python13.7k 101 5831.2k
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Language:Jupyter Notebook13.5k 174 5221.9k
soumith/ganhacks
starter from "How to Train a GAN?" at NIPS2016
11.5k 345 691.7k
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
Language:HTML11.3k 268 49952
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python11.1k 167 8172.5k
arogozhnikov/einops
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
Language:Python8.6k 69 186356
tlkh/asitop
Perf monitoring CLI tool for Apple Silicon
Language:Python3.7k 31 56155
rlworkgroup/garage
A toolkit for reproducible reinforcement learning research.
Language:Python1.9k 56 1k310
mozillazg/pinyin-data
汉字拼音数据
Language:Python1.3k 30 25216
chq1155/A-Survey-on-Generative-Diffusion-Model
925 13 560
google/visqol
Perceptual Quality Estimator for speech and audio
Language:C++720 29 74128
EmulationAI/awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
635 23 336
yangdongchao/AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
Language:Python607 31 4080
sp-uhh/sgmse
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
Language:Python553 12 6477
auspicious3000/contentvec
speech self-supervised representations
Language:Python476 11 3239
Newbeeer/pfgmpp
Code for ICML 2023 paper, "PFGM++: Unlocking the Potential of Physics-Inspired Generative Models"
Language:Python366 10 1435
NVIDIA/CleanUNet
Official PyTorch Implementation of CleanUNet (ICASSP 2022)
Language:Python301 11 051
NVIDIA/NeMo-text-processing
NeMo text processing for ASR and TTS
Language:Python288 15 3791
Srijith-rkr/Whispering-LLaMA
EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction
Language:Jupyter Notebook241 5 1316
NVIDIA/audio-flamingo
PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.
Language:Python216 6 1115
dynamic-superb/dynamic-superb
The official repository of Dynamic-SUPERB.
Language:Python168 7 16889
NVIDIA/NeMo-speech-data-processor
A toolkit for processing speech data and creating speech datasets
Language:Python102 5 222
NVIDIA/NeMo-Run
A tool to configure, launch and manage your machine learning experiments.
Language:Python93 9 4226
sevagh/audio-degradation-toolbox
easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox
Language:Python47 2 810
paarthneekhara/NeMo
NeMo: a toolkit for conversational AI
Language:Python6 2 00
blisc/NeMo
Neural Modules: a toolkit for conversational AI
Language:Python3 1 03
SungFeng-Huang/NeMo
NeMo: a toolkit for conversational AI
Language:Python1 0 00

XuesongYang

XuesongYang's Stars

labmlai/annotated_deep_learning_paper_implementations

facebookresearch/fairseq

google-research/tuning_playbook

state-spaces/mamba

neonbjb/tortoise-tts

soumith/ganhacks

diff-usion/Awesome-Diffusion-Models

NVIDIA/Megatron-LM

arogozhnikov/einops

tlkh/asitop

rlworkgroup/garage

mozillazg/pinyin-data

chq1155/A-Survey-on-Generative-Diffusion-Model

google/visqol

EmulationAI/awesome-large-audio-models

yangdongchao/AcademiCodec

sp-uhh/sgmse

auspicious3000/contentvec

Newbeeer/pfgmpp

NVIDIA/CleanUNet

NVIDIA/NeMo-text-processing

Srijith-rkr/Whispering-LLaMA

NVIDIA/audio-flamingo

dynamic-superb/dynamic-superb

NVIDIA/NeMo-speech-data-processor

NVIDIA/NeMo-Run

sevagh/audio-degradation-toolbox

paarthneekhara/NeMo

blisc/NeMo

SungFeng-Huang/NeMo