A simple Q-Learning Algorithm which works on some gym environments.
Primary LanguagePythonMIT LicenseMIT