mcts

There are 383 repositories under mcts topic.

  • hijkzzz/Awesome-LLM-Strawberry

    A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

  • suragnair/alpha-zero-general

    A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

    Language:Jupyter Notebook4k1131801.1k
  • junxiaosong/AlphaZero_Gomoku

    An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

    Language:Python3.4k102122973
  • opendilab/LightZero

    [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

    Language:Python1.2k12111129
  • chauvinSimon/My_Bibliography_for_Research_on_Autonomous_Driving

    Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"

  • s-casci/tinyzero

    Easily train AlphaZero-like agents on any environment you want!

    Language:Python4073017
  • hrpan/tetris_mcts

    MCTS project for Tetris

    Language:Python34412434
  • dylandjian/SuperGo

    A student implementation of Alpha Go Zero

    Language:Python27912861
  • DataCanvasIO/Hypernets

    A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.

    Language:Python267192340
  • CrazyAra

    QueensGambit/CrazyAra

    A Deep Learning UCI-Chess Variant Engine written in C++ & Python :parrot:

    Language:Jupyter Notebook250215342
  • vgarciasc/mcts-viz

    Visualization of MCTS algorithm applied to Tic-tac-toe.

    Language:JavaScript2113111
  • sungyubkim/Deep_RL_with_pytorch

    A pytorch tutorial for DRL(Deep Reinforcement Learning)

    Language:Jupyter Notebook2087147
  • initial-h/AlphaZero_Gomoku_MPI

    An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku

    Language:Python190104744
  • thuxugang/doudizhu

    AI斗地主

    Language:Python18491366
  • kaesve/muzero

    A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

    Language:Jupyter Notebook1568725
  • akolishchak/doom-net-pytorch

    Reinforcement learning models in ViZDoom environment

    Language:Python1328719
  • manyoso/allie

    Allie: A UCI compliant chess engine

    Language:C++105131520
  • zjeffer/chess-deep-rl

    Research project: create a chess engine using Deep Reinforcement Learning

    Language:Jupyter Notebook102808
  • Sayuri

    CGLemon/Sayuri

    AlphaZero based engine for the game of Go (圍棋/围棋).

    Language:C++9032210
  • PuYuuu/vehicle-interaction-decision-making

    The decision-making of multiple vehicles at intersection bases on level-k game and MCTS

    Language:C++891436
  • blanyal/alpha-zero

    AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.

    Language:Python889228
  • lowrollr/turbozero

    fast + parallel AlphaZero in JAX

    Language:Python882127
  • Urinx/ReinforcementLearning

    Reinforcing Your Learning of Reinforcement Learning

    Language:Python889022
  • YoujiaZhang/AlphaGo-Zero-Gobang

    Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程

    Language:Python82288
  • rlglab/minizero

    MiniZero: An AlphaZero and MuZero Training Framework

    Language:C++736419
  • kobanium/Ray

    Computer go engine using Monte-Carlo Tree Search (MCTS)

    Language:C++72102281
  • gorisanson/quoridor-ai

    Quoridor AI based on Monte Carlo tree search

    Language:JavaScript65528
  • masouduut94/MCTS-agent-python

    Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games and planning problems. In this project I used a board game called "HEX" as a platform to test different simulation strategies in MCTS field.

    Language:Python65319
  • kobanium/TamaGo

    Computer go engine using Monte-Carlo Tree Search written in Python3.

    Language:Python6144011
  • CGLemon/pyDLGO

    基於深度學習的 GTP 圍棋(围棋)引擎,KGS 指引文件以及演算法教學。

    Language:Python59259
  • yangboz/godpaper

    :monkey_face: An AI chess-board-game framework(by many programming languages) implementations.

    Language:HTML4710118
  • TianHongZXY/CoRe

    [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)

    Language:Python44156
  • coreylowman/synthesis

    A rust implementation of AlphaZero algorithm

    Language:Rust43316
  • hr0nix/omega

    A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.

    Language:Python39534
  • michaelbzms/MonteCarloTreeSearch

    A fast C++ impementation of Monte Carlo Tree Search with abstract classes that a user of this library can extend in order to use it. To demonstrate it I apply it to the game of Quoridor.

    Language:C++38306