Pinned Repositories
pytorch-lightning
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
download-musiccaps-dataset
Download the MusicCaps dataset for music captioning
espnet
End-to-End Speech Processing Toolkit
NeMo
NeMo: a toolkit for conversational AI
open_asr_leaderboard
pytorch-lightning
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
slurp
Repository for SLURP paper
speechbrain
A PyTorch-based Speech Toolkit
stevehuang52's Repositories
stevehuang52/download-musiccaps-dataset
Download the MusicCaps dataset for music captioning
stevehuang52/espnet
End-to-End Speech Processing Toolkit
stevehuang52/NeMo
NeMo: a toolkit for conversational AI
stevehuang52/open_asr_leaderboard
stevehuang52/pytorch-lightning
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
stevehuang52/slurp
Repository for SLURP paper
stevehuang52/speechbrain
A PyTorch-based Speech Toolkit