Memory Bandits

Implementation of the Global and Per Arm Switching Thompson Sampling for non-stationary stochastic Multi-Armed Bandit.