dniku/rl-attention

Ensure that agent earns high reward on Colab

dniku opened this issue · 1 comments

dniku commented
Ensure that agent earns high reward on Colab

Apparently you need to just train for way longer to see any results, even on Pong – like 5 million timesteps at least