An implementation of MADDPG

1. Introduction

The experimental environment is a modified version of Waterworld based on MADRL.

The main features (different from MADRL) of the modified Waterworld environment are:

evaders and poisons now bounce at the wall obeying physical rules
sizes of the evaders, pursuers and poisons are now the same so that random actions will lead to average rewards around 0.
need exactly n_coop agents to catch food.

if scene rendering is enabled, recommend to install opencv through conda-forge.

The two agents need to cooperate to achieve the food for reward 10.

the average