laboroai/border

[v0.0.6] Split functionality of ReplayBufferBase into two traits

Closed this issue · 0 comments

This issue is created to split functionality of a replay buffer into two traits, one for pushing samples from environments, the other for making batches to train agents.

In addition, this issue removes the trait bound of Batch: BatchBase in ReplayBufferBase. This modification allows replay buffers implementing this trait to emit batches that does not have the form (o, a, o', r, is_done). Instead, agents should specify trait bounds for the type of samples taken from replay buffers.