/bandits

Algorithms for multiarmed bandits such as e-greedy, thompson sampling, etc.

Primary LanguageJupyter Notebook

Multiarmed bandits

I introduced various multiarmed bandits algorithms such as e-greedy, annealing epsilon greedy, thompson sampling, UCB etc. I also compared the performance of these algorithms and how they can quickly find the best arm.