/Nonstationary-k-arm-Bandits

Python code for a basic RL solution for the Non-stationary (action value function changes with time) k-arm bandit problem. Based on the book "Reinforcement learning: An introduction" by S.Sutton and Andrew G. Barto

Primary LanguagePython

No issues in this repository yet.