Akella17

ML Scientist @ Tesla AutoPilot, MS Robotics @ CMU, B.tech ECE @ IIT Roorkee

Palo Alto, CA

Pinned Repositories

Akella17.github.io
Language:HTML0 0 00
Beta-VAE
To learn and reason like humans, AI must first learn to factorise interpretable representations of independent data generative factors (preferably in an unsupervised manner!!). What does all this mean? Go through this tutorial to get an overview of disentanglement in the context of unsupervised visual disentangled representation learning.
Language:Jupyter Notebook6 3 10
Deep-Bayesian-Quadrature-Policy-Optimization
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.
Language:Python16 4 17
EnhanceNet
Achieves realistic textures by using automated texture synthesis in combination with a perceptual loss rather than focusing on optimizing for a pixel accurate reproduction of ground truth images during training. By using feed-forward fully convolutional neural networks in an adversarial training setting, this approach achieves a significant boost in image quality at high magnification ratios.
Language:Jupyter Notebook5 3 01
Handwriting_Synthesis
This work attempts to generate sequences of handwritten sentences using LSTM network and Mixture Model (Based on the work : https://arxiv.org/pdf/1308.0850.pdf by Alex Graves)
Language:Jupyter Notebook0 3 01
Language_Identification
This is an implementation of a character-level LSTM network for language identification. Inspired from Stanford Language Identification Engine(SLIDE) : https://arxiv.org/abs/1701.03682
Language:Jupyter Notebook0 2 00
Open3D
Open3D: A Modern Library for 3D Data Processing
Language:C++0 2 00
SeqQLearning
Language:Python00
speaker-embedding
A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack
Language:Python10 4 05
Voice_Style_Transfer
Attempts to perform voice transfer, inspired by Gatys et al.'s work in image domain. Uses two pre-trained networks (Wavenet and Speaker Recognition) for perceptual style and context losses.
Language:Jupyter Notebook3 3 02

Akella17's Repositories

Akella17/Deep-Bayesian-Quadrature-Policy-Optimization
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.
Language:Python16 4 17
Akella17/speaker-embedding
A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack
Language:Python10 4 05
Akella17/Beta-VAE
To learn and reason like humans, AI must first learn to factorise interpretable representations of independent data generative factors (preferably in an unsupervised manner!!). What does all this mean? Go through this tutorial to get an overview of disentanglement in the context of unsupervised visual disentangled representation learning.
Language:Jupyter Notebook6 3 10
Akella17/EnhanceNet
Achieves realistic textures by using automated texture synthesis in combination with a perceptual loss rather than focusing on optimizing for a pixel accurate reproduction of ground truth images during training. By using feed-forward fully convolutional neural networks in an adversarial training setting, this approach achieves a significant boost in image quality at high magnification ratios.
Language:Jupyter Notebook5 3 01
Akella17/Voice_Style_Transfer
Attempts to perform voice transfer, inspired by Gatys et al.'s work in image domain. Uses two pre-trained networks (Wavenet and Speaker Recognition) for perceptual style and context losses.
Language:Jupyter Notebook3 3 02
Akella17/Akella17.github.io
Language:HTML0 0 00
Akella17/Handwriting_Synthesis
This work attempts to generate sequences of handwritten sentences using LSTM network and Mixture Model (Based on the work : https://arxiv.org/pdf/1308.0850.pdf by Alex Graves)
Language:Jupyter Notebook0 3 01
Akella17/Language_Identification
This is an implementation of a character-level LSTM network for language identification. Inspired from Stanford Language Identification Engine(SLIDE) : https://arxiv.org/abs/1701.03682
Language:Jupyter Notebook0 2 00
Akella17/Open3D
Open3D: A Modern Library for 3D Data Processing
Language:C++0 2 00
Akella17/SeqQLearning
Language:Python00
Akella17/Speaker-Recognition
TensorFlow implementation of a 3 Layer Stacked LSTM architecture to classify speakers in VCC (2016) dataset.
Language:Jupyter Notebook0 2 01

Akella17

Pinned Repositories

Akella17.github.io

Beta-VAE

Deep-Bayesian-Quadrature-Policy-Optimization

EnhanceNet

Handwriting_Synthesis

Language_Identification

Open3D

SeqQLearning

speaker-embedding

Voice_Style_Transfer

Akella17's Repositories

Akella17/Deep-Bayesian-Quadrature-Policy-Optimization

Akella17/speaker-embedding

Akella17/Beta-VAE

Akella17/EnhanceNet

Akella17/Voice_Style_Transfer

Akella17/Akella17.github.io

Akella17/Handwriting_Synthesis

Akella17/Language_Identification

Akella17/Open3D

Akella17/SeqQLearning

Akella17/Speaker-Recognition