DHDev0/Stochastic-muzero
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.
PythonGPL-3.0
Issues
- 0
Dependency Bug
#12 opened by simwai - 1
- 3
reproducing the result on 2048
#9 opened by LinXueyuanStdio - 4
- 2
Does the code only work on CartPole?
#7 opened by echo1047 - 2
- 10
Questions about is_chance label assignment
#4 opened by timothijoe - 7
Default experiments are not converging
#3 opened by ipsec - 6
training loss: nan
#2 opened by ipsec - 2