akjayant/Coding_Reinforcement_Learning

Implementation of basic RL steps and algorithms - Dynamic Programming approach, Monte-Carlo approach, DQN on Atari, Policy Gradient - Reinforce with baseline, Actor Critic (A2C)

Jupyter NotebookMIT

Watchers

akjayant
Indian Institute of Science
jhcloos