Unable to reproduce CollectMineralShards result

Question

Unable to reproduce CollectMineralShards result

vwxyzjn opened this issue 7 years ago · 2 comments

Hi,

I was trying to reproduce the CollectMineralShards result you posted here:

by running

python run_agent.py --map_name CollectMineralShards --model_name my_mineral_shards_model

However, after 60k steps, the rewards still remain around 25

I did not use the flag --n_envs 32. Would that be the cause of this irreproducibility?

Thank you.

Answer 1 · 2017-11-25T06:34:25.000Z

Hi, actually the x-axis in this chart is number or episodes. In your screenshot can see that you have only 2100 episodes so far. So it's long way to go.

Other notes:
-The training is probably more robust with multiple environments even if run same number of episodes.
-There is lot of randomness involved in the learning process. You will get different training graph each run.

Answer 2 · 2017-11-27T06:41:53.000Z

I see. Thank you!