ivanvovk
• Mathematics • Machine Learning • Computer Vision • Speech technologies
Higher School of Economics, Skoltech
Pinned Repositories
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
compressed-tacotron2-pytorch
Compressed version of Tacotron 2 using Tensor Train + Waveglow.
controllable-face-generation
Controllable Face Generation via pretrained Conditional Adversarial Latent Autoencoder (ALAE)
dmdx-jax
durian-pytorch
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
stochastic-calculus
MCMC and another stochastic calculus stuff
WaveGrad
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
ivanvovk's Repositories
ivanvovk/WaveGrad
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
ivanvovk/durian-pytorch
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
ivanvovk/compressed-tacotron2-pytorch
Compressed version of Tacotron 2 using Tensor Train + Waveglow.
ivanvovk/controllable-face-generation
Controllable Face Generation via pretrained Conditional Adversarial Latent Autoencoder (ALAE)
ivanvovk/dmdx-jax
ivanvovk/stochastic-calculus
MCMC and another stochastic calculus stuff