This project is my MSc dissertation at the University of Edinburgh. I extend the Repeated Update Q-Learning heuristics proposed by Dr. Sherief to the Nature DQN framework.
The two games of Gathering and Wolfpack is orgionally proposed by Dr. Joel and Dr. Vinicius in the paper of Multi-agent Reinforcement Learning in Sequential Social Dilemmas