/Multi-armed-Bandits

In this notebook several classes of multi-armed bandits are implemented. This includes epsilon greedy, UCB, Linear UCB (Contextual bandits) and Kernel UCB. Some of the well cited papers in this context are also implemented.

Primary LanguageJupyter NotebookMIT LicenseMIT

Watchers