feddybear's Stars
Yeongtae/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Yeongtae/waveglow
A Flow-based Generative Network for Speech Synthesis
Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
aolney/manual-subtitle-speech-alignment
Postprocess SRT derived speech alignments for creating clean datasets for machine learning
ozdefir/finetuneas
An HTML interface for finetuning the sync map output from aeneas
jpuigcerver/kaldi-decoders
Custom decoders for Kaldi
NVIDIA/mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
daanzu/kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
CSTR-Edinburgh/ophelia
Sequence-to-sequence TTS based on Kyubyong's dc_tts
chrislgarry/HarkVisualizer
A web app written with the Tornado framework for speech detection and localization in 8-channel flac/wav audio. Try it out with the test.wav file.