abhishm

California

Pinned Repositories

abhishm.github.io
Build a Jekyll blog in minutes, without touching the command line.
Language:SCSS0 2 00
actor_critic
This repository explains the importance of incorporating entropy in policy gradient algorithms.
Language:Python0 2 00
actor_critic_dqn
Language:Jupyter Notebook0 2 01
beyond_dqn
You can find the blog related to this repository here
Language:Python1 2 00
cs61a
Language:Python2 2 12
dqn
A short tutorial about tips and tricks used in implementation of Deep Q-networks that helped in creating a first AI that can successfully play Atari.
Language:Python4 2 00
pg_rnn
There are few caveats when you want to use a Recurrent Neural Network (RNN) policy with Policy Gradient Algorithms. This repository explains them and provide a solution for them. Please see the blog for more details.
Language:Python18 2 32
PGQ
PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.
Language:Python15 6 03
policy_gradient_learning_exercises
Language:Jupyter Notebook1 2 00
vae_class_imbalance
You can find the blog related to this repository here
Language:Python6 3 01

abhishm's Repositories

abhishm/pg_rnn
There are few caveats when you want to use a Recurrent Neural Network (RNN) policy with Policy Gradient Algorithms. This repository explains them and provide a solution for them. Please see the blog for more details.
Language:Python18 2 32
abhishm/PGQ
PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.
Language:Python15 6 03
abhishm/vae_class_imbalance
You can find the blog related to this repository here
Language:Python6 3 01
abhishm/dqn
A short tutorial about tips and tricks used in implementation of Deep Q-networks that helped in creating a first AI that can successfully play Atari.
Language:Python4 2 00
abhishm/cs61a
Language:Python2 2 12
abhishm/beyond_dqn
You can find the blog related to this repository here
Language:Python1 2 00
abhishm/policy_gradient_learning_exercises
Language:Jupyter Notebook1 2 00
abhishm/abhishm.github.io
Build a Jekyll blog in minutes, without touching the command line.
Language:SCSS0 2 00
abhishm/actor_critic
This repository explains the importance of incorporating entropy in policy gradient algorithms.
Language:Python0 2 00
abhishm/batch_normalization
Language:Jupyter Notebook2 0
abhishm/blog
2 0
abhishm/competitive-data-science
Materials for "How to Win a Data Science Competition: Learn from Top Kagglers" course
Language:Jupyter Notebook2 0
abhishm/coursera_predict_future_sales
Language:Jupyter Notebook2 01
abhishm/dark_knowledge
Learn about making a smaller network as good as a big ensemble model that can accelarate inference time.
Language:Python3 1
abhishm/dpg
Language:Python2 0
abhishm/fsdl-text-recognizer-project
Full Stack Deep Learning Bootcamp Project (Public)
Language:Jupyter Notebook2 0
abhishm/gan
Language:Jupyter Notebook2 0
abhishm/gVAE
Geometric Variational Auto Encoder
2 0
abhishm/interactive-coding-challenges
Huge update! Interactive Python coding interview challenges (algorithms and data structures). Includes Anki flashcards.
Language:Python2 0
abhishm/karpathy_class
Language:Jupyter Notebook
abhishm/keras
Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on Theano or TensorFlow.
Language:Python2 0
abhishm/keras_learning
Language:Python2 0
abhishm/language_modeling
Language:Jupyter Notebook
abhishm/PCL
PCL is a powerful algorithm for handling the off-policy data. Please find more details in here.
Language:Python3 0
abhishm/pg
Language:Python2 0
abhishm/pg_rnn_baseline
Language:Python2 0
abhishm/reinforcementLearning
Language:Jupyter Notebook2 0
abhishm/RNN_from_learning_to_learn
Language:Python2 0
abhishm/varying_length_input
Language:Jupyter Notebook1
abhishm/yummy
Solving the Kaggle problem for predicting cuisines based on ingredients
Language:Jupyter Notebook2 0