dhruv-aggarwal/reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

PythonApache-2.0

Reinforcement Learning: An Introduction

Python code for Sutton & Barto's book Reinforcement Learning: An Introduction (2nd Edition)

If you have any confusion about the code or want to report a bug, please open an issue instead of emailing me directly.

Contents

Chapter 1

Tic-Tac-Toe

Chapter 2

Chapter 3

Chapter 4

Chapter 5

Chapter 6

Chapter 7

Figure 7.2: Performance of n-step TD methods on 19-state random walk

Chapter 8

Chapter 9

Chapter 10

Chapter 11

Chapter 12

Chapter 13

Environment

python 3.6
numpy
matplotlib
seaborn
tqdm

Usage

All files are self-contained

python any_file_you_want.py

Contribution

If you want to contribute some missing examples or fix some bugs, feel free to open an issue or make a pull request.

Following are missing figures/examples:

Figure 12.14: The effect of λ