google-deepmind/dqn_zoo

Atari Result Summary Figure

Closed this issue · 2 comments

Hi,

First, thanks for sharing the results.tar.gz.

I am interested in the summary figure provided. However, I cannot exactly reproduce the curves with the given CSV files. My implementation is as follows: for each algorithm (e.g., DQN),

  1. Compute the mean of normalized return over 5 random seeds for each environment.
  2. Plot the solid line with the median and the shaded region with maximal and minimal values over 57 environments using the results from Step 1.

I can basically reproduce the solid lines but the shaded region is not expected.

Your clarification would be very helpful!

Thanks
Ziniu

jqdm commented

Thanks for your interest.

That is one way to generate a similar plot. The method used for the summary plot in the README.md was chosen so that it roughly follows the one in the Rainbow paper, but for multiple seeds.

To generate the summary plot, the data was first smoothed with a moving average with window of size 10 as mentioned in the README.md. Given an agent, frame and seed, the median normalized return over environment is calculated. Then the DataFrame was passed to seaborn.lineplot() with estimator=numpy.median and ci=100, so the shaded area captures the variation over seeds.

If that's not clear, a Colab notebook showing how the summary plot was generated will be uploaded soon.

jqdm commented

Added Colab notebook for generating plots in 98bd61d.