SAC and DDPG parameters for 'Hopper-v3' from Mujoco

Question

SAC and DDPG parameters for 'Hopper-v3' from Mujoco

Will-Nie opened this issue 2 years ago · 2 comments

HI, I recently ran 'Hopper-v3' from mujoco_py (license is free to obtain now) in autonomous-learning-library by adopting the default setting from pybullet 'HopperBulletEnv-v0' and plotted the results as follows: (SAC was the algo I used)

It seems the parameters can not be simply adopted from 'HopperBulletEnv-v0' to run 'Hopper-v3' since normally the score would reach 3000 after 1~2m env step. May I ask if you could make a set of parameters for Hopper-v3 for the algorithm sac and ddpg. Thanks in advance!

Answer 1 · 2024-02-11T20:11:31.000Z

Will try to re-tune this for the v4 mujoco environments.

Answer 2 · 2024-03-02T16:44:41.000Z

I updated the default hyperparameters in #312. For a 2 million step hopper training: