Pinned Repositories
android-vad
This VAD library can process audio in real-time utilizing GMM which helps identify presence of human speech in an audio sample that contains a mixture of speech and noise.
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
kaldi_x-vector_aishell
Using Kaldi x-vector method to train speaker recognition model on aishell database.
NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
wetts
Production First and Production Ready End-to-End Text-to-Speech Toolkit
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
MarsMeng1994's Repositories
MarsMeng1994/android-vad
This VAD library can process audio in real-time utilizing GMM which helps identify presence of human speech in an audio sample that contains a mixture of speech and noise.
MarsMeng1994/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
MarsMeng1994/kaldi_x-vector_aishell
Using Kaldi x-vector method to train speaker recognition model on aishell database.
MarsMeng1994/NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
MarsMeng1994/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
MarsMeng1994/SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
MarsMeng1994/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
MarsMeng1994/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
MarsMeng1994/wetts
Production First and Production Ready End-to-End Text-to-Speech Toolkit