akjayant/Coding_Reinforcement_Learning

Implementation of basic RL steps and algorithms - Dynamic Programming approach, Monte-Carlo approach, DQN on Atari, Policy Gradient - Reinforce with baseline, Actor Critic (A2C)

Jupyter NotebookMIT

Coding the RL elements

Implementation of basic RL steps and algorithms with my personal snippets/notes in jupyter notebook.

References -

Intro to RL - Sutton & Barto
Denny Britz RL Repo - blackjack.py, gridworld.py, plotting.py