keshavvinayak01/Practical-RL-coursera

Jupyter Notebook

Reinforcement Learning algorithms

My Implementations of the Reinforcement Learning algorithms from the Practical_Rl course from Coursera using TensorFlow

In the next few commits, i'll add implementations in PyTorch as well.

The following algorithms have been implemented in these notebooks:

Week 1:

Introduction to TensorFlow
Introduction to Gym
Cross-entropy method
Deep cross entropy method

Week 2:

Markov Decision Process

Week 3:

Q-Learning
Expected value SARSA
Q-Learning using experience replay

Week 4:

Approximate Q-Learning
DQN on Atari: Breakout.

Week 5:

REINFORCE
Asynchronous Actor Critic method (A3C)

Week 6:

Multi Arm bandits (including different approximation methods)
Monte Carlo Tree Search

Research Papers: