"PPO_Continuous.py" trained 1000 EP without effect

Question

"PPO_Continuous.py" trained 1000 EP without effect

Synmul opened this issue 4 years ago · 4 comments

[No changes have been made to the code.
tensorflow version is 2.2, will this affect it?

Answer 1 · 2020-07-01T09:44:32.000Z

Similar thing happened to me . I tried A2C continuous for pendulum without any change (except total episode was set to 3000) but reward is still varies between -1000 to -0 , it rarely goes to -0. So i tried A2C discrete without any change for cartpole and again it is too slow to train ..

Answer 2 · 2021-06-18T22:32:52.000Z

I received the same results - PPO continuous doesn't appear to learn anything. I'm running TF2.3, so it doesn't have to do with your version @Synmul

Answer 3 · 2022-02-17T01:27:57.000Z

Same here. No changes

Answer 4 · 2023-12-28T15:45:55.000Z

@Synmul did you close because it was fixed?