Wrapping For Absorbing State in case of off-policy GAIL/GAIfO/VAIL (DAC)
HesNobi opened this issue · 0 comments
HesNobi commented
Hi,
According to the Discriminator-Actor-Critic (DAC), in order to make use of off-policy RL (SAC), it is nessesery for the absorbing states to be processed and rewarded appropriately.
I am wondering if you would address this issue.
Thanks for the open research and the code.
Related to: #127