/Collaborate_Compete

About using MADDPG which is a variation of Deep Deterministic Policy Gradients to train a pair of agents to play tennis in Unit Environment

Primary LanguageJupyter Notebook

Stargazers