A multi agent reinforcement learning environment where two agents controlled by DRQNs play a custom version of the pursuit-evasion game.
Primary LanguagePythonMIT LicenseMIT