keiohta/tf2rl

Wrapping For Absorbing State in case of off-policy GAIL/GAIfO/VAIL (DAC)

HesNobi opened this issue · 0 comments

Hi,
According to the Discriminator-Actor-Critic (DAC), in order to make use of off-policy RL (SAC), it is nessesery for the absorbing states to be processed and rewarded appropriately.

I am wondering if you would address this issue.

Thanks for the open research and the code.

Related to: #127

image