google-research/robopianist

Observation space and action space

edwhu opened this issue · 1 comments

edwhu commented

Hi,

I have some questions on the observation and action space.

The observation space in the colab for PianoWithShadowHands has a goal key. What exactly is that?
Screen Shot 2023-05-25 at 4 24 51 PM

Next, could you give some details about the reduced action space? Is it helpful for learning? Does it limit the agent's final performance?

edwhu commented

Nevermind, I found the answers.

The goal key seems to be the remaining piano roll.

The reduced action space is mentioned in the paper.