google-deepmind/reverb

Initialize or Clean memory for on-policy

DawoonJang opened this issue · 1 comments

Hello everyone,

If I'm going to use this wonderful replay memory for on-policy algorithm, how do I reset or clean it to match action policy and learn policy?

Thank you.

Hey,

I'm afraid I don't really understand the question.