/ReinforcementLearning-Navigation

Agent trained with the Deep Q-Network (DQN) algorithm to navigate an environment and pick up yellow bananas while avoiding blue bananas

Primary LanguagePythonMIT LicenseMIT

Watchers