ivanvovk

• Mathematics • Machine Learning • Computer Vision • Speech technologies

Higher School of Economics, Skoltech

Pinned Repositories

Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Language:Jupyter Notebook539 23 28111
compressed-tacotron2-pytorch
Compressed version of Tacotron 2 using Tensor Train + Waveglow.
Language:Jupyter Notebook22 1 09
controllable-face-generation
Controllable Face Generation via pretrained Conditional Adversarial Latent Autoencoder (ALAE)
Language:Jupyter Notebook19 3 34
dmdx-jax
Language:Jupyter Notebook0 1 00
durian-pytorch
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
Language:Python181 8 1048
stochastic-calculus
MCMC and another stochastic calculus stuff
Language:Jupyter Notebook0 1 01
WaveGrad
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
Language:Jupyter Notebook398 17 2654

ivanvovk/WaveGrad
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
Language:Jupyter Notebook398 17 2654
ivanvovk/durian-pytorch
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
Language:Python181 8 1048
ivanvovk/compressed-tacotron2-pytorch
Compressed version of Tacotron 2 using Tensor Train + Waveglow.
Language:Jupyter Notebook22 1 09
ivanvovk/controllable-face-generation
Controllable Face Generation via pretrained Conditional Adversarial Latent Autoencoder (ALAE)
Language:Jupyter Notebook19 3 34
ivanvovk/dmdx-jax
Language:Jupyter Notebook0 1 00
ivanvovk/stochastic-calculus
MCMC and another stochastic calculus stuff
Language:Jupyter Notebook0 1 01