ResMARL

Experimenting with different MARL algorithms trying to evaluate their resilience level.

Mainly building on top of the SSD domains repo: https://github.com/eugenevinitsky/sequential_social_dilemma_games

Specific additions, we are experimenting with:

Agents that can communicate over the environment.
Environments that are randomly perturbed over time.
Agents that try to minimize global and self confusion levels via messaging.

simply running train.py will run with 2 communicating agents on the cleanup environment. check default_args.py for changing the running parameters for train.py.

Execution Commands Examples

SRUN

srun --mem=20g -c12 --time=1:0:0 --gres=gpu:3,vmem:16g --killable example_srun_train_py.sh

SBATCH

sbatch example_sbatch_train_py.sh

sarah-keren/ResMARL

ResMARL

Execution Commands Examples

SRUN

SBATCH