PAPAG agent doesn't record all transitions it was trained on
Opened this issue · 0 comments
nickjalbert commented
Currently, we only record the transitions we train on once we complete an episode. However, we might do training on transitions before an episode ends (and, thus, those transitions aren't counted as part of training). This results in the training metrics being off for papag runs.