/MountainCar_ActorCritic

TD Advantage Actor-Critic RL algorithm

Primary LanguageJupyter Notebook

MountainCar_ActorCritic

My simple implementation of actor-critic RL in Tensorflow.

See the blog at https://medium.com/@asteinbach/actor-critic-using-deep-rl-continuous-mountain-car-in-tensorflow-4c1fb2110f7c for description of this code.

For the theory of this actor-critic algo see my blog at https://medium.com/@asteinbach/rl-introduction-simple-actor-critic-for-continuous-actions-4e22afb712

This code originally inspired by Denny Britz' implementation here: https://github.com/dennybritz/reinforcement-learning/blob/master/PolicyGradient/Continuous%20MountainCar%20Actor%20Critic%20Solution.ipynb