ACT w/discrete action space
dirkmcpherson opened this issue · 0 comments
dirkmcpherson commented
Hi, I'm trying to use ACT as a baseline for an imitation learning / MBRL paper. The simplest environment I have uses a discrete action space. Is there any reason to believe that if I make the standard continuous -> discrete changes (listed below) ACT wont perform?
Discrete -> Continuous Changes
- Action head outputs logits and actions are sampled from a categorical distribution
- Cross entropy loss between logits and demonstrated action indexes
thanks for this great work!