Foundations-of-Intelligent-and-Learning-Agent

Assignment solutions CS747

Assignment1 : Multi-armed Bandit that yields Bernoulli rewards. Implemented Thompson sampling algorithm