agentos-project/agentos

PAPAG agent doesn't record all transitions it was trained on

Opened this issue · 0 comments

Currently, we only record the transitions we train on once we complete an episode. However, we might do training on transitions before an episode ends (and, thus, those transitions aren't counted as part of training). This results in the training metrics being off for papag runs.