AlexK-PL

Pinned Repositories

AlexK-PL.github.io
0 1 00
AlexMIIS.github.io
0 1 00
Blizzard2013_Segmentation
Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.
Language:Shell0 0 00
GST-Tacotron
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Language:Python1 1 00
GST_Tacotron2
A NVIDIA's Pytorch Tacotron2 adaptation with unsupervised Global Style Tokens. The model has been trained with the English read-speech LJSpeech Dataset.
Language:Python9 1 05
GST_Tacotron2_PitchContourReference
A NVIDIA's Pytorch Tacotron2 adaptation with unsupervised Global Style Tokens. Instead of using the whole mel-scale spectrogram representation in the GST input, we extracted and used only the pitch contour in a sparse representation. The model has been trained with the English read-speech LJSpeech Dataset.
Language:Python7 1 04
Neural_TTS_Tacotron2_pytorch
A pytorch implementation of a Text-to-Speech system based on NVIDIA's Tacotron2 text2mel plus a neural vocoder
Language:Jupyter Notebook2 1 00
tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Language:Jupyter Notebook1 1 00
Tacotron2-1
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Language:Python1 0 00
Tacotron2_GST_SPM
This is our work on learning speaking style in speech synthesis but only using the pitch frequency sub-band as a speaker reference. We trained a modified version of the NVIDIA's Tacotron2 model but including Global Style Tokens (GST).
Language:Python2 1 00

AlexK-PL's Repositories

AlexK-PL/GST_Tacotron2
A NVIDIA's Pytorch Tacotron2 adaptation with unsupervised Global Style Tokens. The model has been trained with the English read-speech LJSpeech Dataset.
Language:Python9 1 05
AlexK-PL/GST_Tacotron2_PitchContourReference
A NVIDIA's Pytorch Tacotron2 adaptation with unsupervised Global Style Tokens. Instead of using the whole mel-scale spectrogram representation in the GST input, we extracted and used only the pitch contour in a sparse representation. The model has been trained with the English read-speech LJSpeech Dataset.
Language:Python7 1 04
AlexK-PL/Neural_TTS_Tacotron2_pytorch
A pytorch implementation of a Text-to-Speech system based on NVIDIA's Tacotron2 text2mel plus a neural vocoder
Language:Jupyter Notebook2 1 00
AlexK-PL/Tacotron2_GST_SPM
This is our work on learning speaking style in speech synthesis but only using the pitch frequency sub-band as a speaker reference. We trained a modified version of the NVIDIA's Tacotron2 model but including Global Style Tokens (GST).
Language:Python2 1 00
AlexK-PL/GST-Tacotron
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Language:Python1 1 00
AlexK-PL/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Language:Jupyter Notebook1 1 00
AlexK-PL/Tacotron2-1
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Language:Python1 0 00
AlexK-PL/AlexK-PL.github.io
0 1 00
AlexK-PL/AlexMIIS.github.io
0 1 00
AlexK-PL/Blizzard2013_Segmentation
Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.
Language:Shell0 0 00
AlexK-PL/git_course_python
AlexK-PL/Hierarchical-Neural-Autoencoder-1
Language:Python1 0
AlexK-PL/lafrescat-audio-demo
Language:HTML
AlexK-PL/marytts
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
Language:Java
AlexK-PL/MaryTTS-Paragraph_patterns
This is an extension of the MaryTTS 5.3 SNAPSHOT version. This includes rule-based and internal post-processing implementations to include paragraph feature patterns.
1 0
AlexK-PL/ProsodyModifier
A modifier tool for the open-source MaryTTS platform to insert prosody information coming either from a recording or a statistical model
Language:Java1 0
AlexK-PL/punkProse
Punctuation generation for speech transcripts using lexical and prosodic features
Language:Python1 0
AlexK-PL/Spoon-Knife
This repo is for demonstration purposes only.
Language:HTML1 0
AlexK-PL/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
AlexK-PL/waveglow
A Flow-based Generative Network for Speech Synthesis
Language:Python1 0