Implementation of a variety of bandit algorithms, a paradigm in reinforcement learning
Primary LanguagePythonBSD 3-Clause "New" or "Revised" LicenseBSD-3-Clause