Dear authors,
Thanks for your code! I found a possible error in building rtg in Atari.
I think Line 86 should be curr_traj_returns = stepwise_returns[start_index:i]
and Line 88 should be rtg_j = curr_traj_returns[j-start_index:i-start_index]
.
I'm not 100% sure about this.
Best,
Tao