My repository for model-free reinforcement learning algorithms implemented in several environements from the OpenAI gym. The codes are based on pseudocodes presented in Sutton & Barto's book.
Implementation of model-free off-policy Q-learning algorithm in the cart pole and mountain car environments.
Implementation of model-free on-policy SARSA algorithm in the cart pole, mountain car and acrobot environments.