muzero
There are 40 repositories under muzero topic.
opendilab/LightZero
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
huawei-noah/xingtian
xingtian is a componentized library for the development and verification of reinforcement learning algorithms
johan-gras/MuZero
A structured implementation of MuZero
kaesve/muzero
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
yenw/computer-go-dataset
datasets for computer go
Zeta36/muzero
A simple implementation of MuZero algorithm for connect4 game
rlglab/minizero
MiniZero: An AlphaZero and MuZero Training Framework
DHDev0/Stochastic-muzero
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.
Hwhitetooth/jax_muzero
An implementation of MuZero in JAX.
hr0nix/omega
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.
tuero/muzero-cpp
A C++ pytorch implementation of MuZero
sail-sg/rosmo
Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
DHDev0/Muzero-unplugged
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.
michaelnny/muzero
A PyTorch implementation of DeepMind's MuZero agent
bellerb/chappie.ai
Generalized AI to perform a multitude of tasks written in python3
DHDev0/Muzero
Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space.
Itomigna2/Muesli-lunarlander
Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)
rystrauss/dopamax
Reinforcement learning in pure JAX.
jianzhnie/RLZero
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
benborder/drla
C++ Deep Reinforcement Learning Agent library
seawee1/efficientalphazero
AlphaZero for singleplayer environments implemented efficiently using Ray
hayashimasa/Robust_MuZero
A robust variant of MuZero
BIGBALLON/Toward-AGZ
Materials for AlphaGo
AntoniovanDijck/BlackJackRL
Deep Q Learning blackbox strategies for casino games
abrahamabel/Muzero-GDM_Pseudo_Code
A Notebook implementation of the Pseudocode from the original Muzero paper
benborder/drla-atari
Trains deep reinforcement learning agents in Atari environments via the DRLA library.
benborder/drla-sim
Trains a deep reinforcement learning agent in simulation testbed environments with the DRLA library.
mdhiebert/meta-minichess
Meta-learning experiments for the game of minichess and related rule variants.
Nebraskinator/SuperMarioBrosAI
MuZero for Super Mario Bros
souvikshanku/tic-tac-toe-zero
MuZero - tic-tac-toe
svenssona/muzero
Learning how muzero works
abrahamabel/GenesisZero
GenesisZERO : potential applications for MCTS agents with LLMs for Sequential decision-making
ChukwumaChukwuma/enyimba_ai
Applying AlphaZero Self-Play Tactics to LLaMA for Enhanced Chatbot Interaction