Multi-armed bandit Here we implement the epsilon-greedy strategy for solving the multi-armed bandit problem.