pekaalto/sc2aibot

Unable to reproduce CollectMineralShards result

vwxyzjn opened this issue · 2 comments

Hi,

I was trying to reproduce the CollectMineralShards result you posted here:

image

by running

python run_agent.py --map_name CollectMineralShards --model_name my_mineral_shards_model

However, after 60k steps, the rewards still remain around 25

capture

I did not use the flag --n_envs 32. Would that be the cause of this irreproducibility?

Thank you.

Hi, actually the x-axis in this chart is number or episodes. In your screenshot can see that you have only 2100 episodes so far. So it's long way to go.

Other notes:
-The training is probably more robust with multiple environments even if run same number of episodes.
-There is lot of randomness involved in the learning process. You will get different training graph each run.

I see. Thank you!