Reinforcement-Learning: An introduction

Overview

This repository contains code, exercises and solutions for popular Reinforcement Learning algorithms. These are meant to serve as a learning tool to complement the theoretical materials from:

Environment and Libraries

Python3
Jupyter notebook
Numpy
Matplotlib
Six
Seaborn

All code is written in Python 3 and uses RL environments from OpenAI Gym. Advanced techniques use Tensorflow for neural network implementations.

TODO

RL Course by David Silver

1. Lectures

2. Reinforcement Learning Assignment: Easy21

Reinforcement Learning: An Introduction

Notebooks for Sutton & Barto's book Reinforcement Learning: An Introduction (2nd Edition)

Chapter 1: The Reinforcement Learning Problem

Chapter 2: Multi-arm Bandits

Chapter 3: Finite Markov Decision Processes

Chapter 4: Dynamic Programming

Chapter 5: Monte Carlo Methods

Chapter 6: Temporal-Difference Learning

Chapter 7: Multi-step Bootstrapping

Chapter 8: Planning and Learning with Tabular Methods

Chapter 9: On-policy Prediction with Approximation

Chapter 10: On-policy Control with Approximation

Chapter 11: Off-policy Methods with Approximation

Chapter 12: Eligibility Traces

Chapter 13: Policy Gradient Methods

Extra

andersy005/reinforcement-learning-an-introduction