/MonteCarlo

Implementation of first visit Monte Carlo for prediction and control

Primary LanguageJupyter Notebook

Monte Carlo Methods

This repo shows how to implement first visit monte carlo for both prediction and control using the blackjack OpenAI gym environment. This implementation is based off the algorithms describe in Reinforcement Learning: An Introduction by Sutton and Barto, and the following repositories

You can read the full explanation of the algorithms in the accompanying medium article found here. https://towardsdatascience.com/learning-to-win-blackjack-with-monte-carlo-methods-61c90a52d53e