till2/RL-Bandit

Jupyter Notebook

Readme
0Issues
0Stargazers
1Watcher

RL-Bandit

Implementation of the Upper Confidence Bound Section from "Introduction to RL" by Sutton & Barto. Just playing around and trying different approaches.

Share to

Contact site admin: Geeks.