Algorithms for multiarmed bandits such as e-greedy, thompson sampling, etc.
Jupyter Notebook
Multiarmed bandits
I introduced various multiarmed bandits algorithms such as e-greedy, annealing epsilon greedy, thompson sampling, UCB etc. I also compared the performance of these algorithms and how they can quickly find the best arm.