A multi-threaded implementation of AlphaZero
- Free-style Gomoku
- Tree/Root Parallelization with Virtual Loss/LibTorch
- Gomoku and MCTS are written in C++
- SWIG wrap C++ extension
- Update 2019.7.10: support Ubuntu and Windows
Edit config.py
- Python 3.7
- PyGame 1.9
- PyTorch 1.1
- LibTorch 1.1
- MSVC14.0/GCC6.0+
- CMake 3.8+
- SWIG 3.0.12
# Add LibTorch/SWIG to environment variable $PATH
# Compile Python extension
mkdir build
cd build
cmake .. -DCMAKE_PREFIX_PATH=path/to/libtorch -DCMAKE_BUILD_TYPE=Release
cmake --build
# Run
cd ../test
python learner_test.py train # train model
python learner_test.py play # play with human
Trained 2 days on GTX1070
Link: https://pan.baidu.com/s/1c2Otxdl7VWFEXul-FyXaJA Password: e5y4
- Mastering the Game of Go without Human Knowledge
- Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
- Parallel Monte-Carlo Tree Search
- An Analysis of Virtual Loss in Parallel MCTS
- A Lock-free Multithreaded Monte-Carlo Tree Search Algorithm
- github.com/suragnair/alpha-zero-general