/deep_RL-multi-arm-bandit-exploration

This is an implementation of the Reinforcement Learning multi-arm-bandit experiment using different exploration techniques.

Primary LanguagePython

Stargazers