go1 learns 3 legged policy

Question

go1 learns 3 legged policy

Opened this issue 10 months ago · 3 comments

Hello, when I try training the base policy for go2 python train.py --exptid 054-01-go1 --task go1 it learns to only use the front 2 legs and the back left leg. I tried training the a1 policy to see if it happens but that turned out fine and then I tried training go1 again and it learned 3 legs again. Is this something with the go1 config?

Answer 1 · 2023-10-02T06:42:16.000Z

Go1 support is not added yet, so training with --task go1 will have unexpected behaviors especially the terrains contain extreme cases since the max torque for go1 is 23.7Nm vs A1~33 Nm. You can try to change the urdf here instead of using --task go1 and see what happens.

Answer 2 · 2023-10-31T08:27:23.000Z

I am also interested in training the go1 policy. May I ask that have you successfully deployed this algorithm after modifying the urdf? By the way, the go1 urdf file in Isaac Sim seems to be the old version.

Answer 3 · 2023-12-12T06:24:08.000Z

some problem with you. When I try to train the base policy for other quals, it always learns to use the front leg and the back left leg. I tried to lower the difficulty of the terrain and reduce the rew_feet_edge, but they don't work. May I ask you that have you successfully deploy the alogorithm or fix the problem?