Multi-Armed-Bandit Different algorithms to solve the n-armed bandit problem including: Epsilon-greedy Softmax Reinforcement Comparison Thompson Sampling UCB1 UCB2