blanyal/alpha-zero
AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.
PythonMIT
Issues
- 0
Log directory for Tensorboard
#2 opened by ayman803 - 0
Policy loss too high
#1 opened by aletote