akjayant/Coding_Reinforcement_Learning
Implementation of basic RL steps and algorithms - Dynamic Programming approach, Monte-Carlo approach, DQN on Atari, Policy Gradient - Reinforce with baseline, Actor Critic (A2C)
Jupyter NotebookMIT
Implementation of basic RL steps and algorithms - Dynamic Programming approach, Monte-Carlo approach, DQN on Atari, Policy Gradient - Reinforce with baseline, Actor Critic (A2C)
Jupyter NotebookMIT