/upper-confidence-bound-and-epsilon-greedy-for-Bandit

In this project we solve bandit classic problem for 3 bandit machine that generate rewards with gaussian distribution with upper confidence bound and epsilon greedy method.

Primary LanguageJupyter Notebook

upper-confidence-bound-and-epsilon-greedy-for-Bandit

In this project we solve bandit classic problem for 3 bandit machine that generate rewards with gaussian distribution with upper confidence bound and epsilon greedy method.you can check out the solutoins in file bandit_UCB_Egreedy.ipynb