/HIVE

:honeybee: Neural network-driven multi-agent Prisoner's Dilemmas.

Primary LanguagePythonMIT LicenseMIT



WARNING: VERY OLD CODE

Most of the code was written in 2020. I have learned a lot since then and I am aware of the poor quality of the code.



HIVE GUI

HIVE

Table of contents

📜 Project description

Platform created to facilitate conducting spatial multi-agent iterated prisoner's dilemma experiments between groups controlled by RL algorithm incorporating artificial neural networks. Uses Double Dueling Deep Q-Network, a deep reinforcement learning algorithm.

Based on keras-rl2.
Simultaneous agent inspired by interleaved.py by Velochy.

🔨 Technologies used

  • Python 3.7.11
  • Pygame 1.9.6
  • Numpy 1.18.5
  • OpenAI gym 0.17.2
  • Matplotlib 3.3.0
  • Keras_rl2 1.0.4
  • Tensorflow 2.3.0
  • Theano 1.0.5

⬆️ Room for improvement

This is an old project of mine and it would certainly benefit from:

  • refactoring the code into better functions,
  • refactoring the code into different files,
  • getting rid of many anti-patterns,
  • having tests written for it.

📞 Contact

📈 Changelog:

📅 01.02.2021

  • no GUI version for faster experiments

📅 03.01.2021

  • board's nature is hexagonal now instead of rectangular
  • added optional controllable debug agent

📅 23.12.2020

  • the program is running in cycles now, X simulations in a row
  • added appending data to .csv files after each simulation

📅 10.12.2020

  • using DQN instead of DDPG (DDPG couldn't handle TFT)
  • UI changes

📅 03.12.2020

  • using DDPG instead of DQN now
  • agents have parameters that impact their actions

📅 21.11.2020

  • agents now exist on 2d 43x43 plane and interact when on the same tile
  • added visualisation done in pygame

📅 04.11.2020

  • more than two players can play now (interleaved training)
  • added 7 NPCs with popular PD strategies
  • players now include current episode step and type of the enemy in their observations

📅 28.10.2020

  • created stable, developable version of two neural nets playing PD against each other using:
    • custom OpenAI gym
    • custom keras-rl2 agent
    • custom keras-rl2 callbacks

📅 21.09.2020

  • created OpenAI custom gym environment
  • connected it to keras-rl2, doesn't work yet, need to create a proper network for it

📅 16.09.2020

  • created the repository
  • added prisoner's dilemma basic mechanism

👷 Author

🔓 License

MIT