`absolute_action_mask` removed

Question

`absolute_action_mask` removed

Opened this issue 4 months ago · 4 comments

Hello,
I noticed that the new commits removed the absolute_action_mask argument.
I was wondering how is action padding being managed now?

Thanks

Answer 1 · 2024-05-28T08:37:18.000Z

Hey, I encountered the same problem, have you successfully ran any scripts of the new committed version

Answer 2 · 2024-06-06T20:41:36.000Z

I was wandering the same. Any update?

Answer 3 · 2024-06-07T17:25:24.000Z

We are also encountering blocking issues with this. Any additional detail regarding action padding would be really helpful. Thank you.

Answer 4 · 2024-06-07T21:29:43.000Z

Hi, sorry the late reply! Previously, the absolute_action_mask was used to figure out what to do with actions in a chunk that go past the point where the goal was achieved (either zero-ing them out or duplicating the last action so that the policy is trained to remain at the goal). Now, instead of using absolute_action_mask and trying to create neutral actions, the dataloader simply indicates when the goal (or the end of the trajectory, in case of no goals) has been reached using the key task_completed. It also updates action_pad_mask to indicate that any actions past the end of the goal should be considered padding.