ACT w/discrete action space

Question

ACT w/discrete action space

dirkmcpherson opened this issue 9 months ago · 0 comments

Hi, I'm trying to use ACT as a baseline for an imitation learning / MBRL paper. The simplest environment I have uses a discrete action space. Is there any reason to believe that if I make the standard continuous -> discrete changes (listed below) ACT wont perform?

Discrete -> Continuous Changes

Action head outputs logits and actions are sampled from a categorical distribution
Cross entropy loss between logits and demonstrated action indexes

thanks for this great work!