Gobang

This is a PyTorch-based implementation of AlphaGo Zero algorithm for gobang game. Also, provide an OpenAI Gym style environment interface for gobang game

Environment

Python 3.*
PyTorch 0.3.*

Usage

Training

python train.py

Result

6*6 board, n_in_row is 4

training time

2~3 hours
results

9:1 versus pure monte carlo tree search algorithm(with 10 times sample counts)

TODO

Add interface for human player
Implement render_ interface for gobang environment so as to provide an UI

Thidtc/gobang

Gobang

Environment

Usage

Result

6*6 board, n_in_row is 4

TODO