Reproduction of self-play described in paper "Emergent Complexity via Multi-Agent Competition", adapted from PPO2 implementation in OpenAI baselines.
SigmaBM/robosumo-selfplay
Reproduction of self-play described in paper "Emergent Complexity via Multi-Agent Competition", adapted from PPO2 implementation in OpenAI baselines.
Python