/reinforcement-learning

Personal experiments on Reinforcement Learning

Primary LanguageJupyter Notebook

Reinforcement Learning

Realizations
  • Old experiments on RL (2016)
  • Solving OpenAI Gym environments (2017-2018)
  • Developing an multi agent Tic Tac Toe environment and solving it with Policy Gradients (May 2017)
  • Using RL to automatically adapt the cooling in a Data Center (August 2017)
  • Controlling Robots via Reinforcement Learning (November 2017)
  • Playing and solving the Chrome Dinosaur Game with Evolution Strategies and PyTorch (January 2018)
  • Delivery optimization using Reinforcement Learning

References and inspiration

RL references
Q Learning references
Deep Q Learning
Policy Gradient
Evolution strategies
Actor Critic, A2C, ACKTR
PPO, TRPO
AlphaGo
Monte Carlo Tree Search
Misc
Environment

Papers