Levent9

Levent9's Stars

jaejunL/HYFace
Language:Python4
naver-ai/facetts
Language:Python486
facebookresearch/muavic
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation
Language:Python36531
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python6.6k707
zhvng/open-musiclm
Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.
Language:Python52460
TencentGameMate/chinese_speech_pretrain
chinese speech pretrained models
Language:Shell1k87
NVIDIA/audio-flamingo
PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.
Language:Python20314
gabrielmittag/NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
Language:Python699118
jishengpeng/ControlSpeech
ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec
Language:Python2028
YangAi520/LL-NSPP
Language:Python121
modelscope/FunCodec
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Language:Python37131
yxlu-0102/AP-BWE
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction
Language:Python535
YangAi520/LFS-NSPP
Language:Python9
YangAi520/APNet
Language:Python302
YangAi520/NSPP
Language:Python483
Levent9/Zero-shot-FaceVC
Language:Python171
danoneata/xts
being a multi-speaker video-to-speech network
Language:Python72
yxlu-0102/MP-SENet
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Language:Python32646
yangdongchao/InstructTTS
The deme page of InstructTTS
1558
Wendison/VQMIVC
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
Language:Jupyter Notebook34055