Opened this issue 4 years ago · 0 comments
I cannot solve the continuous control problem of the Pendulum with your implementation in Chapter 07, i.e., PPO.
When the program exits finally, the problem is still not solved. Could you please verify it and tell me how to reproduce your solution? Thx.