Q-learning and SARSA algorithms from Sutton's Reinforcement Learning book.
Primary LanguagePythonMIT LicenseMIT