Pinned Repositories
abhishm.github.io
Build a Jekyll blog in minutes, without touching the command line.
actor_critic
This repository explains the importance of incorporating entropy in policy gradient algorithms.
actor_critic_dqn
beyond_dqn
You can find the blog related to this repository here
cs61a
dqn
A short tutorial about tips and tricks used in implementation of Deep Q-networks that helped in creating a first AI that can successfully play Atari.
pg_rnn
There are few caveats when you want to use a Recurrent Neural Network (RNN) policy with Policy Gradient Algorithms. This repository explains them and provide a solution for them. Please see the blog for more details.
PGQ
PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.
policy_gradient_learning_exercises
vae_class_imbalance
You can find the blog related to this repository here
abhishm's Repositories
abhishm/pg_rnn
There are few caveats when you want to use a Recurrent Neural Network (RNN) policy with Policy Gradient Algorithms. This repository explains them and provide a solution for them. Please see the blog for more details.
abhishm/PGQ
PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.
abhishm/vae_class_imbalance
You can find the blog related to this repository here
abhishm/dqn
A short tutorial about tips and tricks used in implementation of Deep Q-networks that helped in creating a first AI that can successfully play Atari.
abhishm/cs61a
abhishm/beyond_dqn
You can find the blog related to this repository here
abhishm/policy_gradient_learning_exercises
abhishm/abhishm.github.io
Build a Jekyll blog in minutes, without touching the command line.
abhishm/actor_critic
This repository explains the importance of incorporating entropy in policy gradient algorithms.
abhishm/batch_normalization
abhishm/blog
abhishm/competitive-data-science
Materials for "How to Win a Data Science Competition: Learn from Top Kagglers" course
abhishm/coursera_predict_future_sales
abhishm/dark_knowledge
Learn about making a smaller network as good as a big ensemble model that can accelarate inference time.
abhishm/dpg
abhishm/fsdl-text-recognizer-project
Full Stack Deep Learning Bootcamp Project (Public)
abhishm/gan
abhishm/gVAE
Geometric Variational Auto Encoder
abhishm/interactive-coding-challenges
Huge update! Interactive Python coding interview challenges (algorithms and data structures). Includes Anki flashcards.
abhishm/karpathy_class
abhishm/keras
Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on Theano or TensorFlow.
abhishm/keras_learning
abhishm/language_modeling
abhishm/PCL
PCL is a powerful algorithm for handling the off-policy data. Please find more details in here.
abhishm/pg
abhishm/pg_rnn_baseline
abhishm/reinforcementLearning
abhishm/RNN_from_learning_to_learn
abhishm/varying_length_input
abhishm/yummy
Solving the Kaggle problem for predicting cuisines based on ingredients