lbehringer

lbehringer's Stars

espeak-ng/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Language:C4.1k872
KrishnaDN/x-vector-pytorch
Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
Language:Python10125
TaoRuijie/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Language:Python588113
audioku/cross-accent-maml-asr
Meta-learning model agnostic (MAML) implementation for cross-accented ASR
Language:Python436
andi611/Mockingjay-Speech-Representation
Official Implementation of Mockingjay in Pytorch
Language:Python5211
Wendison/VQMIVC
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
Language:Jupyter Notebook33455
facebookincubator/AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Language:Python4.5k361
mrdbourke/pytorch-deep-learning
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
Language:Jupyter Notebook10.3k3k
espnet/espnet
End-to-End Speech Processing Toolkit
Language:Python8.3k2.2k
musikalkemist/pytorchforaudio
Code for the "PyTorch for Audio + Music Processing" series on The Sound of AI YouTube channel.
Language:Python23683
neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Language:Jupyter Notebook12.9k1.8k
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python52.2k8.7k
DigitalPhonetics/IMS-Toucan
Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.
Language:Python1.4k156