zouwei02's Stars
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
magenta/magenta
Magenta: Music and Art Generation with Machine Intelligence
espnet/espnet
End-to-End Speech Processing Toolkit
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
buriburisuri/speech-to-text-wavenet
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow
X-PLUG/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
X-PLUG/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
Delta-ML/delta
DELTA is a deep learning based natural language and speech processing platform.
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
descriptinc/melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
athena-team/athena
an open-source implementation of sequence-to-sequence based speech processing engine
nttcslab-sp/kaldiio
A pure python module for reading and writing kaldi ark files
didi/athena
A release version for https://github.com/athena-team/athena
LianjiaTech/athena
An open-source implementation of sequence-to-sequence based speech processing engine