Learn equilibrium strategies for the two-player game RoboCub with the following reinforcement learning algorithms:
- Q-learning
- Foe-Q learning
- Friend-Q learning
- Correlated-Q learning
To create the environment
conda env create -f environment.yml
To activate the environment
conda activate ceq
To generate Q-values using default Correlated-Q learning parameters
python main.py configs/ceq_learner.py results