The YAML file of PPO algorithm training

Question

The YAML file of PPO algorithm training

Closed this issue 2 years ago · 3 comments

Could you please provide the YAML file of PPO algorithm training, millions of thanks!

Answer 1 · 2022-08-03T17:51:24.000Z

Hello!

The implementation of PPO seems to be broken by a recent refactoring. I will fix it and provide an example config for PPO soon.

Keep in mind that in this repo I've been primarily focused on polishing MuZero implementation, PPO has been used mostly for the purpose of testing things before doing anything complicated. I've never tested it on anything harder than random_room_5x5.

Answer 2 · 2022-08-08T13:08:03.000Z

I've fixed the implementation of PPO and added a PPO config for room_5x5 environment in 9d8c3c9

Answer 3 · 2022-08-09T05:51:42.000Z

Thanks so much for your great work, it is really helpful.