MarsMeng1994

Pinned Repositories

android-vad
This VAD library can process audio in real-time utilizing GMM which helps identify presence of human speech in an audio sample that contains a mixture of speech and noise.
Language:C0 1 00
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Language:Shell0 0 00
kaldi_x-vector_aishell
Using Kaldi x-vector method to train speaker recognition model on aishell database.
Language:Shell0 0 00
NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
Language:Python0 0 00
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python0 1 00
SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Language:Python0 0 00
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0 00
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:C++0 1 00
wetts
Production First and Production Ready End-to-End Text-to-Speech Toolkit
Language:Python0 0 00
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python34.9k 342 2.7k4.1k

MarsMeng1994's Repositories

MarsMeng1994/android-vad
This VAD library can process audio in real-time utilizing GMM which helps identify presence of human speech in an audio sample that contains a mixture of speech and noise.
Language:C0 1 00
MarsMeng1994/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Language:Shell0 0 00
MarsMeng1994/kaldi_x-vector_aishell
Using Kaldi x-vector method to train speaker recognition model on aishell database.
Language:Shell0 0 00
MarsMeng1994/NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
Language:Python0 0 00
MarsMeng1994/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python0 1 00
MarsMeng1994/SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Language:Python0 0 00
MarsMeng1994/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0 00
MarsMeng1994/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:C++0 1 00
MarsMeng1994/wetts
Production First and Production Ready End-to-End Text-to-Speech Toolkit
Language:Python0 0 00