lbehringer's Stars
espeak-ng/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
KrishnaDN/x-vector-pytorch
Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
TaoRuijie/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
audioku/cross-accent-maml-asr
Meta-learning model agnostic (MAML) implementation for cross-accented ASR
andi611/Mockingjay-Speech-Representation
Official Implementation of Mockingjay in Pytorch
Wendison/VQMIVC
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
facebookincubator/AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
mrdbourke/pytorch-deep-learning
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
espnet/espnet
End-to-End Speech Processing Toolkit
musikalkemist/pytorchforaudio
Code for the "PyTorch for Audio + Music Processing" series on The Sound of AI YouTube channel.
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
DigitalPhonetics/IMS-Toucan
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.