self-play
There are 53 repositories under self-play topic.
suragnair/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
opendilab/DI-engine
OpenDILab Decision AI Engine
opendilab/DI-star
An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
opendilab/LightZero
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
uclaml/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
inspirai/TimeChamber
A Massively Parallel Large Scale Self-Play Framework
ChuaCheowHuan/gym-continuousDoubleAuction
A custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum continuous double auction. Ray [RLlib] is used for training.
blanyal/alpha-zero
AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.
seungeunrho/football-paris
The exact codes used by the team "liveinparis" at the kaggle football competition ranked 6th/1141
Naton1/osrs-pvp-reinforcement-learning
Train a neural network to PvP in Old School RuneScape using reinforcement learning.
dellalibera/td-gammon
TD-Gammon implementation
dellalibera/gym-backgammon
Backgammon OpenAI Gym
ShibiHe/Model-Free-Episodic-Control
This is the implementation of paper Model Free Episodic Control
tobiasemrich/SchafkopfRL
AI agents for the bavarian card game Schafkopf trained with reinforcement learning
Sebastian-Schuchmann/Self-Play-TicTacToe-AI-ML-Agents-
A Self Play reinforcement learning Agent learns to play TicTacToe using the ML-Agents Framework in Unity.
sirmammingtonham/alphastone
Using self-play, MCTS, and a deep neural network to create a hearthstone ai player
cestpasphoto/alpha-zero-general
A very fast implementation of AlphaZero, applied to games like Splendor, Santorini, The Little Prince, … Browser version available
uclaml/SPPO
The official implementation of Self-Play Preference Optimization (SPPO)
mbaske/ml-selfplay-fighter
Self-Play Boxing Match made with Unity Machine Learning Agents
af1tang/convogym
A gym environment to train chatbots.
cmubig/sorts
Code base for Social Robot Tree Search (SoRTS).
backpropper/s2p
Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)
peldszus/alpha-zero-general-lib
An implementation of the AlphaZero algorithm for adversarial games to be used with the machine learning framework of your choice
AutumnCrocus/shadow_sim
Emulator and AI of Shadowverse
Jackory/RPBT
Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)
ChuaCheowHuan/PBT_MARL_watered_down
My attempt to reproduce a water down version of PBT (Population based training) for MARL (Multi-agent reinforcement learning) using DDPPO (Decentralized & distributed proximal policy optimization) from ray[rllib].
OneUpWallStreet/TD-Gammon
Implementation of TD Gammon algorithm by Gerald Tesauro at IBM's Thomas J. Watson Research Center in Python.
e-dong/space-war-rl
Recreating Bill Seiler's 1985 version of Space War and training RL agents with Self-Play
neoyung/connect-4
A reinforcement learning agent trained without prior human knowledge
navreeetkaur/AlphaGoZero
Implementation of Alpha Go Zero - Reinforcement Learning Project, COL870 @iit-delhi
novoselov-ab/ai-zero
Implementation of an AlphaGo Zero paper in one C++ header file without any dependencies
cedrickchee/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Galtvam/OthelloZero
A Smart Agent using reinforcement learning with CNN + MCTS to learn to play Othello/Reversi
jianzhnie/RLZero
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
Kajune/KODOKU
Multi-agent Self-Play Reinforcement Learning Library
TARTRL/TARTRL
基于PyTorch的分布式强化学习框架