Pinned Repositories
antispoofing-features
Code for the paper "Bag of features for voice anti-spoofing"
common-voice
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
Cross-Lingual-Voice-Cloning
Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
grpc
The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
gst-tacotron
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
TransformerTTS
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
agonzalezd's Repositories
agonzalezd/antispoofing-features
Code for the paper "Bag of features for voice anti-spoofing"
agonzalezd/common-voice
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
agonzalezd/Cross-Lingual-Voice-Cloning
Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
agonzalezd/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
agonzalezd/grpc
The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
agonzalezd/gst-tacotron
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
agonzalezd/mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
agonzalezd/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
agonzalezd/PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
agonzalezd/TransformerTTS
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
agonzalezd/TTS-Style-Transfer
Official Pytorch implementation of TTS Style Transfer