Pinned Repositories
active-learning
attn2d_fork
engagement_detection
automatic engagement detection in a group of children
face_emotion_recognition_keras_tf
This repository based on https://github.com/oarriaga/face_classification.
Keras_highways
Keras-based Highway layers
keras_sgan_ser
This is a project of speech emotion recognition using KERAS based Semi-Generative Adversarial Networks.
LIVE_SER
Live demo for speech emotion recognition using Keras and Tensorflow models
SER_FEAT_EXT
This repository includes source codes and documents explaining feature extraction of speech emotion recognition.
SER_KERAS_TF_TRAINER
This repository includes source codes and documents for Keras/Tensorflow based speech emotion recognition model training.
WaveRNN
Pytorch implementation of Deepmind's WaveRNN model
batikim09's Repositories
batikim09/LIVE_SER
Live demo for speech emotion recognition using Keras and Tensorflow models
batikim09/BVAE-TTS
Official implementation of BVAE-TTS
batikim09/DeepLearningExamples
Deep Learning Examples
batikim09/distiller
Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://nervanasystems.github.io/distiller
batikim09/EATS
A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech
batikim09/espnet
End-to-End Speech Processing Toolkit
batikim09/FastSpeech
The Implementation of FastSpeech based on pytorch.
batikim09/FloWaveNet
A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
batikim09/flowtron
Auto-regressive flow-based generative network for text to speech synthesis
batikim09/generative_models
Pytorch implementations of generative models: VQVAE2, AIR, DRAW, InfoGAN, DCGAN, SSVAE
batikim09/Glow_TTS
An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.
batikim09/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
batikim09/INTERSPEECH19_TUTORIAL
Interspeech 2019 tutorial materials
batikim09/interspeech2019-tutorial
INTERSPEECH 2019 Tutorial Materials
batikim09/LPCNet
Efficient neural speech synthesis
batikim09/melgan
Unofficial PyTorch implementation of MelGAN vocoder (Training in progress)
batikim09/Parallel-Tacotron2
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
batikim09/ParallelWaveGAN
Unofficial Parallel WaveGAN implementation with Pytorch
batikim09/PHASEN-PyTorch
batikim09/PortaSpeech
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
batikim09/reformer-pytorch
Reformer, the efficient Transformer, in Pytorch
batikim09/survae_flows
Code for paper "SurVAE Flows: Surjections to Bridge the Gap between VAEs and Flows"
batikim09/torchdiffeq
Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.
batikim09/transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
batikim09/vae-audio
Variational auto-encoders for audio
batikim09/VAE-CVAE-MNIST
Variational Autoencoder and Conditional Variational Autoencoder on MNIST in PyTorch
batikim09/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
batikim09/Waveglow_Inference_in_CUDA
C++ Code to run waveglow inference in cuda
batikim09/WaveNODE
Pytorch Implementation of WaveNODE
batikim09/WaveRNN-Pytorch
Fatcord's Alternative WaveRNN (Faster training)