werner-duvaud/muzero-general

Sampled MuZero implementation

matthiaskiller opened this issue · 1 comments

Search before asking

  • I have searched the MuZero issues and found no similar feature requests.

Description

Hey,

I'm wondering if there is any intention to expand the code towards Sampled MuZero to make it work for continuous action spaces? According to Learning and Planning in Complex Action Spaces by Hubert et al.

Thanks!

Additional context

No response

Hello, thank you to the contributors for their outstanding work on this repository. Regarding the issue you've raised, you might be interested in the project "LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios". This repository not only supports the AlphaZero algorithm but also extends support to MuZero and a series of related algorithms (including Sampled MuZero) and environments, which might meet your requirements. Best wishes.