amitkml/Reinforcement_Learning

Reinforcement learning tutorials

Python

Readme
0Issues
1Stargazer
1Watcher

Reinforcement Learning Tutorials:

PPO and PPO_CNN agents playing Pong-v0 game:

Deep Q Learning tutorial (DQN)
Double Deep Q Learning tutorial (DDQN)
Dueling Double Deep Q Learning tutorial (D3QN)
Epsilon Greedy Dueling Double Deep Q Learning tutorial (D3QN)
Prioritized Experience Replay (PER) D3QN tutorial
D3QN PER with Convolutional Neural Networks tutorial
A.I. learns to play Pong with DQN
Introduction to RL Policy Gradient (PG or REINFORCE)
Introduction to RL Advanced Actor Critic algorythm (A2C)
Introduction to RL Asynchronous Advanced Actor Critic algorythm (A3C)
Introduction to RL Proximal Policy Optimization algorythm (PPO)

PPO Pong-v0 Learning curve:

Share to

Contact site admin: Geeks.