mcts

There are 358 repositories under mcts topic.

  • suragnair/alpha-zero-general

    A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

    Language:Jupyter Notebook3.7k1131761k
  • junxiaosong/AlphaZero_Gomoku

    An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

    Language:Python3.2k102119958
  • opendilab/LightZero

    [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios

    Language:Python909128382
  • chauvinSimon/My_Bibliography_for_Research_on_Autonomous_Driving

    Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"

  • s-casci/tinyzero

    Easily train AlphaZero-like agents on any environment you want!

    Language:Python3963014
  • hrpan/tetris_mcts

    MCTS project for Tetris

    Language:Python33613434
  • dylandjian/SuperGo

    A student implementation of Alpha Go Zero

    Language:Python27612862
  • DataCanvasIO/Hypernets

    A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.

    Language:Python260192340
  • CrazyAra

    QueensGambit/CrazyAra

    A Deep Learning UCI-Chess Variant Engine written in C++ & Python :parrot:

    Language:Jupyter Notebook232195342
  • sungyubkim/Deep_RL_with_pytorch

    A pytorch tutorial for DRL(Deep Reinforcement Learning)

    Language:Jupyter Notebook2007144
  • thuxugang/doudizhu

    AI斗地主

    Language:Python18291366
  • initial-h/AlphaZero_Gomoku_MPI

    An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku

    Language:Python181104442
  • vgarciasc/mcts-viz

    Visualization of MCTS algorithm applied to Tic-tac-toe.

    Language:JavaScript1633111
  • kaesve/muzero

    A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

    Language:Jupyter Notebook1468723
  • akolishchak/doom-net-pytorch

    Reinforcement learning models in ViZDoom environment

    Language:Python1319720
  • manyoso/allie

    Allie: A UCI compliant chess engine

    Language:C++104131521
  • blanyal/alpha-zero

    AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.

    Language:Python849228
  • Urinx/ReinforcementLearning

    Reinforcing Your Learning of Reinforcement Learning

    Language:Python849021
  • zjeffer/chess-deep-rl

    Research project: create a chess engine using Deep Reinforcement Learning

    Language:Jupyter Notebook76708
  • kobanium/Ray

    Computer go engine using Monte-Carlo Tree Search (MCTS)

    Language:C++69101881
  • lowrollr/turbozero

    fast + parallel AlphaZero in JAX

    Language:Python69284
  • YoujiaZhang/AlphaGo-Zero-Gobang

    Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程

    Language:Python66278
  • Sayuri

    CGLemon/Sayuri

    AlphaZero based engine for the game of Go (圍棋/围棋).

    Language:C++632207
  • masouduut94/MCTS-agent-python

    Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games and planning problems. In this project I used a board game called "HEX" as a platform to test different simulation strategies in MCTS field.

    Language:Python60319
  • gorisanson/quoridor-ai

    Quoridor AI based on Monte Carlo tree search

    Language:JavaScript58428
  • CGLemon/pyDLGO

    基於深度學習的 GTP 圍棋(围棋)引擎,KGS 指引文件以及演算法教學。

    Language:Python51258
  • kobanium/TamaGo

    Computer go engine using Monte-Carlo Tree Search written in Python3.

    Language:Python5044011
  • rlglab/minizero

    MiniZero: An AlphaZero and MuZero Training Framework

    Language:C++50539
  • yangboz/godpaper

    :monkey_face: An AI chess-board-game framework(by many programming languages) implementations.

    Language:HTML4710118
  • coreylowman/synthesis

    A rust implementation of AlphaZero algorithm

    Language:Rust39315
  • hr0nix/omega

    A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.

    Language:Python36534
  • xuetf/AlphaZero_Gobang

    Deep Learning big homework of UCAS

    Language:Python362117
  • masterai-top/MasterAI-2.0-1vs1-NoLimit

    MasterAI is an AI poker dedicated to suport n-play (single- or multi-agent) Texas Hold'em imperfect-informatin games.

    Language:C++32329
  • michaelbzms/MonteCarloTreeSearch

    A fast C++ impementation of Monte Carlo Tree Search with abstract classes that a user of this library can extend in order to use it. To demonstrate it I apply it to the game of Quoridor.

    Language:C++32305
  • TianHongZXY/CoRe

    [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models

    Language:Python32155