feddybear

Nothing much to see here.

Tokyo, Japan

feddybear's Stars

Yeongtae/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Language:Jupyter Notebook304
Yeongtae/waveglow
A Flow-based Generative Network for Speech Synthesis
Language:Python61
Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
Language:Python2.3k905
aolney/manual-subtitle-speech-alignment
Postprocess SRT derived speech alignments for creating clean datasets for machine learning
Language:F#172
ozdefir/finetuneas
An HTML interface for finetuning the sync map output from aeneas
Language:JavaScript5325
jpuigcerver/kaldi-decoders
Custom decoders for Kaldi
Language:C++8028
NVIDIA/mellotron
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
Language:Jupyter Notebook855183
daanzu/kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Language:Python33950
CSTR-Edinburgh/ophelia
Sequence-to-sequence TTS based on Kyubyong's dc_tts
Language:Python6021
chrislgarry/HarkVisualizer
A web app written with the Tornado framework for speech detection and localization in 8-channel flac/wav audio. Try it out with the test.wav file.
Language:JavaScript2010