This refers to the k-armed bandit problem and the RL algorithms used to solve it.
Primary LanguagePythonMIT LicenseMIT