Sampled MuZero implementation
matthiaskiller opened this issue · 1 comments
Search before asking
- I have searched the MuZero issues and found no similar feature requests.
Description
Hey,
I'm wondering if there is any intention to expand the code towards Sampled MuZero to make it work for continuous action spaces? According to Learning and Planning in Complex Action Spaces by Hubert et al.
Thanks!
Additional context
No response
Hello, thank you to the contributors for their outstanding work on this repository. Regarding the issue you've raised, you might be interested in the project "LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios". This repository not only supports the AlphaZero algorithm but also extends support to MuZero and a series of related algorithms (including Sampled MuZero) and environments, which might meet your requirements. Best wishes.