Pinned Repositories
Akella17.github.io
Beta-VAE
To learn and reason like humans, AI must first learn to factorise interpretable representations of independent data generative factors (preferably in an unsupervised manner!!). What does all this mean? Go through this tutorial to get an overview of disentanglement in the context of unsupervised visual disentangled representation learning.
Deep-Bayesian-Quadrature-Policy-Optimization
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.
EnhanceNet
Achieves realistic textures by using automated texture synthesis in combination with a perceptual loss rather than focusing on optimizing for a pixel accurate reproduction of ground truth images during training. By using feed-forward fully convolutional neural networks in an adversarial training setting, this approach achieves a significant boost in image quality at high magnification ratios.
Handwriting_Synthesis
This work attempts to generate sequences of handwritten sentences using LSTM network and Mixture Model (Based on the work : https://arxiv.org/pdf/1308.0850.pdf by Alex Graves)
Language_Identification
This is an implementation of a character-level LSTM network for language identification. Inspired from Stanford Language Identification Engine(SLIDE) : https://arxiv.org/abs/1701.03682
Open3D
Open3D: A Modern Library for 3D Data Processing
SeqQLearning
speaker-embedding
A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack
Voice_Style_Transfer
Attempts to perform voice transfer, inspired by Gatys et al.'s work in image domain. Uses two pre-trained networks (Wavenet and Speaker Recognition) for perceptual style and context losses.
Akella17's Repositories
Akella17/Deep-Bayesian-Quadrature-Policy-Optimization
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.
Akella17/speaker-embedding
A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack
Akella17/Beta-VAE
To learn and reason like humans, AI must first learn to factorise interpretable representations of independent data generative factors (preferably in an unsupervised manner!!). What does all this mean? Go through this tutorial to get an overview of disentanglement in the context of unsupervised visual disentangled representation learning.
Akella17/EnhanceNet
Achieves realistic textures by using automated texture synthesis in combination with a perceptual loss rather than focusing on optimizing for a pixel accurate reproduction of ground truth images during training. By using feed-forward fully convolutional neural networks in an adversarial training setting, this approach achieves a significant boost in image quality at high magnification ratios.
Akella17/Voice_Style_Transfer
Attempts to perform voice transfer, inspired by Gatys et al.'s work in image domain. Uses two pre-trained networks (Wavenet and Speaker Recognition) for perceptual style and context losses.
Akella17/Akella17.github.io
Akella17/Handwriting_Synthesis
This work attempts to generate sequences of handwritten sentences using LSTM network and Mixture Model (Based on the work : https://arxiv.org/pdf/1308.0850.pdf by Alex Graves)
Akella17/Language_Identification
This is an implementation of a character-level LSTM network for language identification. Inspired from Stanford Language Identification Engine(SLIDE) : https://arxiv.org/abs/1701.03682
Akella17/Open3D
Open3D: A Modern Library for 3D Data Processing
Akella17/SeqQLearning
Akella17/Speaker-Recognition
TensorFlow implementation of a 3 Layer Stacked LSTM architecture to classify speakers in VCC (2016) dataset.