Deep Learning and the Game of Go

Code, sample chapters and notebooks for the book "Deep Learning and the Game of Go" (Manning), available for early access here.

Playable demos

The book is all about getting you started to create your own bots. To make the experience more fun and interactive, we built and deployed several bots showcasing the techniques of the respective chapter. So far you can play:

Chapter 4 A tiny tree search bot on a 5x5 board here.
Chapter 7 A full 19x19 bot powered by a deep neural network trained to predict human moves here
Chapter 9 A bot playing on a 9x9 board, using policy gradients, here

These demos will be available in the liveBook version of the book as well.

Toward deep learning: a machine learning introduction
Go as a machine learning problem
Implementing your first Go bot
Playing games with tree search
Getting started with neural networks
Designing a neural network for Go data
Learning from data: a deep learning bot
Deploying bots in the wild
Enter deep reinforcement learning
Reinforcement learning with policy gradients
Reinforcement learning with value methods
Reinforcement learning with actor-critic methods
AlphaGo: Combining approaches
AlphaGoZero and AlphaZero: Combining approaches

Appendices

A. Mathematical foundations with Python
B. The backpropagation algorithm
C. Go programs and servers
D. Training and deploying bots using Amazon Web Services
E. Submitting a bot to the Online Go Server (OGS)

Welcome

When AlphaGo hit the news in early 2016, we were extremely excited about this groundbreaking advancement in computer Go. At the time, it was largely conjectured that human-level artificial intelligence for the game of Go was at least 10 years in the future. We followed the games meticulously and didn’t shy away from waking up early or staying up late to watch the broadcasted games live. Indeed, we had good company — millions of people around the globe were captivated by the games against Fan Hui, Lee Sedol, and later, Ke Jie and others.

Shortly after the emergence of AlphaGo, we picked up work on a little open source library we coined BetaGo to see if we could implement some of the core mechanisms running AlphaGo ourselves. The idea of BetaGo was to bring AlphaGo to regular developers. While we were realistic enough to accept that we didn’t have the resources (time, computing power, or intelligence) to compete with Deepmind's incredible achievement, it has been a lot of fun to create our own Go bot. Since then, we’ve had the privilege to speak about computer Go at quite a few occasions.

We strongly believe that the principles underpinning AlphaGo can be taught to a general software engineering audience in a practical manner. Enjoyment and understanding of Go comes from playing it and experimenting with it. It can be argued that the same holds true for machine learning, or any other discipline for that matter. In this book, we hope to use the game of Go as a gateway to the exciting world of deep learning. We start with the classical AI principles for board games. Right from the start, you’ll have a working Go AI that you can play against—although it will be very weak at first. Then we’ll introduce some techniques from the world of deep learning and reinforcement learning. As you learn each technique, you can incorporate it into your Go AI and watch it improve.

If you share some of our enthusiasm for either Go or machine learning, hopefully both, at the end of this book, we’ve done our job. If, on top of that, you know how to build and ship a Go bot and run your own experiments, many other interesting artificial intelligence applications will be accessible to you as well. Enjoy the ride!

kschelonka/deep_learning_and_the_game_of_go

Deep Learning and the Game of Go

Playable demos

Table of Contents

Welcome