ikostrikov/pytorch-a2c-ppo-acktr-gail

Insert obs, action in storge (PPO)

Closed this issue · 0 comments

In storage.py,
inserting obs at (self.step + 1) index, action at (self.step) index.
Then when we make batch, we get data (s_{t-1}, a_t, r_t ...).
Obs and action are different time step data.
Can I get some Intuition of this?