introduction

AlphaGo is the first computer program to defeat a professional human Go player, the first to defeat a Go world champion, and is arguably the strongest Go player in history.

Deepmind

What is in this repo?

This is a refactor of AppliedDataSciencePartners/DeepReinforcementLearning part of the article How to build your own AlphaZero AI using Python and Keras

The purpose of the refactor

Make experimentation easier
Add more documentation
Explain the algorithms
Improve performance

How to run it?

Install Python 3.6.*
1. Do not install a newer version it will not compile
Install libraries in requirements.txt
Execute main.py

Road map

Improve documentation
Fix jupyter notebook
Add unit test
Add profiler instrumentation
1. Learning vs MCTS
2. Memory usage
Add Dependency Injector
Add simple games to make it easier to understand
Change logger to Resource provider
1. Make logger less intrusive
2. Improve log data
3. Consider the use of sampling to reduce the size
Add stats to a DB (remove some logs)
Update to tensorflow 2
Add capacity to migrate models to TensorFlow.js
Add online demos
1. Use TensorFlow.js models in online games
2. Player VS NPC
3. NPC VS NPC
4. Interactive learning process
Config
1. Make config adjustable by game
  1. Parameters can vary from game to game, having all in one config makes it difficult for experimentation
2. Use defaults

luiskarlos/DeepReinforcementLearning

introduction

What is in this repo?

The purpose of the refactor

How to run it?

Road map

Wondering

Resource Links