self-play

There are 53 repositories under self-play topic.

suragnair/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Language:Jupyter Notebook3.7k 113 1761k
opendilab/DI-engine
OpenDILab Decision AI Engine
Language:Python2.7k 20 190346
opendilab/DI-star
An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
Language:Python1.2k 17 25110
opendilab/LightZero
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
Language:Python936 11 8887
uclaml/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
Language:Python874 11 2675
inspirai/TimeChamber
A Massively Parallel Large Scale Self-Play Framework
Language:Python187 8 1121
ChuaCheowHuan/gym-continuousDoubleAuction
A custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum continuous double auction. Ray [RLlib] is used for training.
Language:Jupyter Notebook137 6 132
blanyal/alpha-zero
AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.
Language:Python85 9 228
seungeunrho/football-paris
The exact codes used by the team "liveinparis" at the kaggle football competition ranked 6th/1141
Language:Python58 4 312
Naton1/osrs-pvp-reinforcement-learning
Train a neural network to PvP in Old School RuneScape using reinforcement learning.
Language:Java55 3 019
dellalibera/td-gammon
TD-Gammon implementation
Language:Python40 2 313
dellalibera/gym-backgammon
Backgammon OpenAI Gym
Language:Python39 2 514
ShibiHe/Model-Free-Episodic-Control
This is the implementation of paper Model Free Episodic Control
Language:Python37 6 411
tobiasemrich/SchafkopfRL
AI agents for the bavarian card game Schafkopf trained with reinforcement learning
Language:Python35 5 25
Sebastian-Schuchmann/Self-Play-TicTacToe-AI-ML-Agents-
A Self Play reinforcement learning Agent learns to play TicTacToe using the ML-Agents Framework in Unity.
Language:C#32 6 09
sirmammingtonham/alphastone
Using self-play, MCTS, and a deep neural network to create a hearthstone ai player
Language:Python29 2 47
cestpasphoto/alpha-zero-general
A very fast implementation of AlphaZero, applied to games like Splendor, Santorini, The Little Prince, … Browser version available
Language:Python28 2 510
uclaml/SPPO
The official implementation of Self-Play Preference Optimization (SPPO)
Language:Python241
mbaske/ml-selfplay-fighter
Self-Play Boxing Match made with Unity Machine Learning Agents
Language:C#21 2 07
af1tang/convogym
A gym environment to train chatbots.
Language:Python20 4 03
cmubig/sorts
Code base for Social Robot Tree Search (SoRTS).
Language:Python20 1 03
backpropper/s2p
Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)
Language:Python16 3 12
peldszus/alpha-zero-general-lib
An implementation of the AlphaZero algorithm for adversarial games to be used with the machine learning framework of your choice
Language:Python12 3 03
AutumnCrocus/shadow_sim
Emulator and AI of Shadowverse
Language:Python11 1 11
Jackory/RPBT
Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)
Language:Python10 1 01
ChuaCheowHuan/PBT_MARL_watered_down
My attempt to reproduce a water down version of PBT (Population based training) for MARL (Multi-agent reinforcement learning) using DDPPO (Decentralized & distributed proximal policy optimization) from ray[rllib].
Language:Jupyter Notebook8 2 01
OneUpWallStreet/TD-Gammon
Implementation of TD Gammon algorithm by Gerald Tesauro at IBM's Thomas J. Watson Research Center in Python.
Language:Python7 1 01
e-dong/space-war-rl
Recreating Bill Seiler's 1985 version of Space War and training RL agents with Self-Play
Language:Python6 2 200
neoyung/connect-4
A reinforcement learning agent trained without prior human knowledge
Language:Jupyter Notebook6 1 05
navreeetkaur/AlphaGoZero
Implementation of Alpha Go Zero - Reinforcement Learning Project, COL870 @iit-delhi
Language:Python5 4 00
novoselov-ab/ai-zero
Implementation of an AlphaGo Zero paper in one C++ header file without any dependencies
Language:C++5 2 04
cedrickchee/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python4 3 01
Galtvam/OthelloZero
A Smart Agent using reinforcement learning with CNN + MCTS to learn to play Othello/Reversi
Language:Python4 3 150
jianzhnie/RLZero
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
Language:Python4 4 0
Kajune/KODOKU
Multi-agent Self-Play Reinforcement Learning Library
Language:Python3 1 00
TARTRL/TARTRL
基于PyTorch的分布式强化学习框架
Language:Python3 2 00

self-play

suragnair/alpha-zero-general

opendilab/DI-engine

opendilab/DI-star

opendilab/LightZero

uclaml/SPIN

inspirai/TimeChamber

ChuaCheowHuan/gym-continuousDoubleAuction

blanyal/alpha-zero

seungeunrho/football-paris

Naton1/osrs-pvp-reinforcement-learning

dellalibera/td-gammon

dellalibera/gym-backgammon

ShibiHe/Model-Free-Episodic-Control

tobiasemrich/SchafkopfRL

Sebastian-Schuchmann/Self-Play-TicTacToe-AI-ML-Agents-

sirmammingtonham/alphastone

cestpasphoto/alpha-zero-general

uclaml/SPPO

mbaske/ml-selfplay-fighter

af1tang/convogym

cmubig/sorts

backpropper/s2p

peldszus/alpha-zero-general-lib

AutumnCrocus/shadow_sim

Jackory/RPBT

ChuaCheowHuan/PBT_MARL_watered_down

OneUpWallStreet/TD-Gammon

e-dong/space-war-rl

neoyung/connect-4

navreeetkaur/AlphaGoZero

novoselov-ab/ai-zero

cedrickchee/baselines

Galtvam/OthelloZero

jianzhnie/RLZero

Kajune/KODOKU

TARTRL/TARTRL