Some evaluation results are missing
muupan opened this issue · 2 comments
muupan commented
In scores.txt
of the current uploaded trained model, evaluation results at 55000000
and 56000000
are missing.
I don't know why and whether it can affect performance. I need to check.
muupan commented
I found that missing evaluation is caused by processes stuck in evaluate_performance()
. It is possible that some policies fail start to play Breakout, preventing episodes from being terminated. If so, it might be necessary to use epsilon-greedy-like action selection in addition to sampling from softmax policies in test runs.
muupan commented
It didn't occurred for Space Invaders. For Breakout we might need to force long episodes to finish.