Using a range of methods to approche the k armed bandit problem, inspired by Reinforcement Learning: An Introduction by Andrew Barto and Richard S. Sutton.
Primary LanguageJupyter NotebookMIT LicenseMIT
No issues in this repository yet.