google-deepmind/dqn_zoo

vectored environments

Closed this issue · 2 comments

Hi,
Does this work with vectorized environments (such as using gym.vector.AsyncVectorEnv function)?

Best,
Raymond

jqdm commented

Not out of the box since DQN and variants expect a single stream of experience. One could get something working reasonably quickly by adapting TransitionAccumulator so it accepts a batch of time steps.

Awesome, thanks @jqdm !