JingdiC/Distributional-Multi-Agent-Actor-Critic-Reinforcement-Learning-MADDPG-Tennis-Environment

The state-of-the-art in multi-agent Reinforcement Learning is the MADDPG algorithm which utilises DDPG actor-critic neural networks where each agent uses centralized critic training but decentralized actor execution, and is capable of learning either cooperative or competitive environments. This is demonstrated on the Unity Tennis Environment.

Jupyter Notebook

Stargazers

No one’s star this repository yet.