mcts

There are 383 repositories under mcts topic.

hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
5.9k 90 9322
suragnair/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Language:Jupyter Notebook4k 113 1801.1k
junxiaosong/AlphaZero_Gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Language:Python3.4k 102 122973
werner-duvaud/muzero-general
MuZero
Language:Python2.5k 74 175616
opendilab/LightZero
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Language:Python1.2k 12 111129
chauvinSimon/My_Bibliography_for_Research_on_Autonomous_Driving
Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"
446 33 198
s-casci/tinyzero
Easily train AlphaZero-like agents on any environment you want!
Language:Python407 3 017
hrpan/tetris_mcts
MCTS project for Tetris
Language:Python344 12 434
dylandjian/SuperGo
A student implementation of Alpha Go Zero
Language:Python279 12 861
DataCanvasIO/Hypernets
A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.
Language:Python267 19 2340
QueensGambit/CrazyAra
A Deep Learning UCI-Chess Variant Engine written in C++ & Python :parrot:
Language:Jupyter Notebook250 21 5342
vgarciasc/mcts-viz
Visualization of MCTS algorithm applied to Tic-tac-toe.
Language:JavaScript211 3 111
sungyubkim/Deep_RL_with_pytorch
A pytorch tutorial for DRL(Deep Reinforcement Learning)
Language:Jupyter Notebook208 7 147
initial-h/AlphaZero_Gomoku_MPI
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
Language:Python190 10 4744
thuxugang/doudizhu
AI斗地主
Language:Python184 9 1366
kaesve/muzero
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
Language:Jupyter Notebook156 8 725
akolishchak/doom-net-pytorch
Reinforcement learning models in ViZDoom environment
Language:Python132 8 719
manyoso/allie
Allie: A UCI compliant chess engine
Language:C++105 13 1520
zjeffer/chess-deep-rl
Research project: create a chess engine using Deep Reinforcement Learning
Language:Jupyter Notebook102 8 08
CGLemon/Sayuri
AlphaZero based engine for the game of Go (圍棋/围棋).
Language:C++90 3 2210
PuYuuu/vehicle-interaction-decision-making
The decision-making of multiple vehicles at intersection bases on level-k game and MCTS
Language:C++89 1 436
blanyal/alpha-zero
AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.
Language:Python88 9 228
lowrollr/turbozero
fast + parallel AlphaZero in JAX
Language:Python88 2 127
Urinx/ReinforcementLearning
Reinforcing Your Learning of Reinforcement Learning
Language:Python88 9 022
YoujiaZhang/AlphaGo-Zero-Gobang
Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型，主要用以了解AlphaGo Zero的运行原理的Demo，即神经网络是如何指导MCTS做出决策的，以及如何自我对弈学习。源码+教程
Language:Python82 2 88
rlglab/minizero
MiniZero: An AlphaZero and MuZero Training Framework
Language:C++73 6 419
kobanium/Ray
Computer go engine using Monte-Carlo Tree Search (MCTS)
Language:C++72 10 2281
gorisanson/quoridor-ai
Quoridor AI based on Monte Carlo tree search
Language:JavaScript65 5 28
masouduut94/MCTS-agent-python
Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games and planning problems. In this project I used a board game called "HEX" as a platform to test different simulation strategies in MCTS field.
Language:Python65 3 19
kobanium/TamaGo
Computer go engine using Monte-Carlo Tree Search written in Python3.
Language:Python61 4 4011
CGLemon/pyDLGO
基於深度學習的 GTP 圍棋（围棋）引擎，KGS 指引文件以及演算法教學。
Language:Python59 2 59
yangboz/godpaper
:monkey_face: An AI chess-board-game framework(by many programming languages) implementations.
Language:HTML47 10 118
TianHongZXY/CoRe
[ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)
Language:Python44 1 56
coreylowman/synthesis
A rust implementation of AlphaZero algorithm
Language:Rust43 3 16
hr0nix/omega
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.
Language:Python39 5 34
michaelbzms/MonteCarloTreeSearch
A fast C++ impementation of Monte Carlo Tree Search with abstract classes that a user of this library can extend in order to use it. To demonstrate it I apply it to the game of Quoridor.
Language:C++38 3 06

mcts

hijkzzz/Awesome-LLM-Strawberry

suragnair/alpha-zero-general

junxiaosong/AlphaZero_Gomoku

werner-duvaud/muzero-general

opendilab/LightZero

chauvinSimon/My_Bibliography_for_Research_on_Autonomous_Driving

s-casci/tinyzero

hrpan/tetris_mcts

dylandjian/SuperGo

DataCanvasIO/Hypernets

QueensGambit/CrazyAra

vgarciasc/mcts-viz

sungyubkim/Deep_RL_with_pytorch

initial-h/AlphaZero_Gomoku_MPI

thuxugang/doudizhu

kaesve/muzero

akolishchak/doom-net-pytorch

manyoso/allie

zjeffer/chess-deep-rl

CGLemon/Sayuri

PuYuuu/vehicle-interaction-decision-making

blanyal/alpha-zero

lowrollr/turbozero

Urinx/ReinforcementLearning

YoujiaZhang/AlphaGo-Zero-Gobang

rlglab/minizero

kobanium/Ray

gorisanson/quoridor-ai

masouduut94/MCTS-agent-python

kobanium/TamaGo

CGLemon/pyDLGO

yangboz/godpaper

TianHongZXY/CoRe

coreylowman/synthesis

hr0nix/omega

michaelbzms/MonteCarloTreeSearch