Pinned Repositories
AMTV
Some results from research project "Acoustic modeling and transformation of varieties for speech synthesis"
Audio-Games
German audio-only games using TTS
ecg-loss
e-contaminated Gausssian distribution loss for Keras with Tensorflow backend
NeMo
NeMo: a toolkit for conversational AI
osue_exercise1
Microtrans Games Inc. presents Ryskim
osue_exercise2
Teaching material Markov Models
SALB
Frontend system for HMM-based speech synthesis models generated by HTS.
tacorn
2018/2019 TTS framework integrating state of the art open source methods
Tacotron-WaveRNN
TTS (Tacotron + WaveRNN)
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
m-toman's Repositories
m-toman/tacorn
2018/2019 TTS framework integrating state of the art open source methods
m-toman/SALB
Frontend system for HMM-based speech synthesis models generated by HTS.
m-toman/Audio-Games
German audio-only games using TTS
m-toman/AMTV
Some results from research project "Acoustic modeling and transformation of varieties for speech synthesis"
m-toman/ecg-loss
e-contaminated Gausssian distribution loss for Keras with Tensorflow backend
m-toman/melgan
Unofficial PyTorch implementation of MelGAN vocoder (Training in progress)
m-toman/NeMo
NeMo: a toolkit for conversational AI
m-toman/Tacotron-WaveRNN
TTS (Tacotron + WaveRNN)
m-toman/TTS
Deep learning for Text to Speech
m-toman/waveglow
A Flow-based Generative Network for Speech Synthesis
m-toman/osue_exercise1
Microtrans Games Inc. presents Ryskim
m-toman/osue_exercise2
Teaching material Markov Models
m-toman/twelvedaysofxmas
Twelve days of XMas in twelve languages
m-toman/decord
An efficient video loader for deep learning with smart shuffling that's super easy to digest
m-toman/langgraph
Build resilient language agents as graphs.
m-toman/magphase
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
m-toman/merlin
This is now the official location of the Merlin project.
m-toman/mimic
Mycroft's TTS engine, based on CMU's Flite (Festival Lite)
m-toman/notebooks
m-toman/nv-wavenet
Reference implementation of real-time autoregressive wavenet inference
m-toman/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
m-toman/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
m-toman/UniversalVocoding
A PyTorch implementation of "Robust Universal Neural Vocoding"
m-toman/vim-config
vim linux settings
m-toman/vim-win-config
gvim windows settings
m-toman/wavenet_vocoder
WaveNet vocoder
m-toman/WaveRNN
Pytorch implementation of Deepmind's WaveRNN model
m-toman/WaveRNN-Pytorch
Fatcord's Alternative WaveRNN (Faster training)