A number of basic reinforcement learning agents in various environments, starting with the most basic multi-armed bandits scenario.
mdumke/reinforcement-learning-experiments
Training basic reinforcement learning agents in various environments
Jupyter Notebook