/Reinforcement-Learning

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Build Status master contributions welcome License

Reinforcement Learning

TODO

  • Decision Making Under Uncertainty
  • Markov Decision Processes
  • Values Functions & Bellman Equations
  • Dynamic Programming
  • Monte Carlo Methods
  • Temporal Difference Methods
  • Planning, Learning & Acting
  • On-Policy Prediction with Approximation
  • Constructing Features for Prediction
  • Control with Approximation
  • Policy Gradient Methods
    • REINFORCE
    • Actor-Critic
    • Off-Policy Policy Gradient
    • A2C
    • A3C
    • DDPG
    • PPO
    • SAC
    • TD3