Reinforcement Learning

Overview

This is an repository of reinforcement learning that I'm currently working on.

Dueling architecture Double Q Learning with prioritized experience replay after training 4,500 episodes.

Lunar lander continuous environment by using Deep Deterministic Policy Gradient. (Fail)

Deep Deterministic Policy Gradient (DDPG) in Pendulum environment and moving average reward

Reinforcement Learning An Introduction, Richard S. Sutton and Andrew G. Barto
Grokking Deep Reinforcement Learning, Miguel Morales
Coursera Reinforcement Learning Specialization by University of Alberta (https://www.coursera.org/specializations/reinforcement-learning)
Udacity Deep Reinforcement Learning Nanodegree (https://www.udacity.com/course/deep-reinforcement-learning-nanodegree--nd893)
Reading papers from OpenAI Spinning Up key papers (https://spinningup.openai.com/en/latest/spinningup/keypapers.html)