Fail to train of GAIL in Ant-v2 environment

Question

Fail to train of GAIL in Ant-v2 environment

seolhokim opened this issue 3 years ago · 3 comments

seolhokim commented 3 years ago

I trained your ppo first.

python examples/ppo_gym.py --env-name Ant-v2 --save-model-interval 100

After 500 episodes, I made trajectories.

python gail/save_expert_traj.py --model-path assets/learned_models/Ant-v2_ppo.p

Last, I ran gail.

python gail/gail_gym.py --env-name Ant-v2 --expert-traj-path assets/expert_traj/Ant-v2_expert_traj.p

I implemented Gail and Vail, but I failed to train it too.(but hopper worked well)

Any Ideas?

Answer 1 · 2022-02-03T04:49:14.000Z

Hi! Have you work it out?
It seems to be the problem of zfilter.

Answer 2 · 2022-02-06T05:21:47.000Z

No. I guess you doubt the training has failed because of standardization by zfilter, right? I checked but it was not the key to solving problem in my implementation.

Answer 3 · 2022-05-13T06:33:06.000Z

Any solution to this ? I am running in the same problem. The code works well for all the other MuJoCo I have tried but not for HalfCheetah