Reinforcement learning on starcraft using pysc2.
I used vanissa policy gradient to solve this environments/
python beacon.py --training 1
The model will be saved every 100 epochs into the "logger" folder. The model name is "simple_save".
python beacon.py --model simple_save --replay 1
--replay 1 is used to save a replay of the game every 10 run.
python beacon.py --model simple_save --training 1
python -m pysc2.bin.play --replay <absolute-path>