A Q-learning solver applied to the gym environment Taxi-v2.
Primary LanguageJupyter NotebookMIT LicenseMIT