/reinforcement-learning-an-introduction

Implementation of Reinforcement Learning Algorithms: Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book & David Silver's course.

Primary LanguageJupyter NotebookMIT LicenseMIT

Reinforcement-Learning: An introduction

Overview

This repository contains code, exercises and solutions for popular Reinforcement Learning algorithms. These are meant to serve as a learning tool to complement the theoretical materials from:

Environment and Libraries

  • Python3
  • Jupyter notebook
  • Numpy
  • Matplotlib
  • Six
  • Seaborn

All code is written in Python 3 and uses RL environments from OpenAI Gym. Advanced techniques use Tensorflow for neural network implementations.


TODO

RL Course by David Silver

1. Lectures

2. Reinforcement Learning Assignment: Easy21


Reinforcement Learning: An Introduction

Notebooks for Sutton & Barto's book Reinforcement Learning: An Introduction (2nd Edition)

Contents

Chapter 1: The Reinforcement Learning Problem

Chapter 2: Multi-arm Bandits

Chapter 3: Finite Markov Decision Processes

Chapter 4: Dynamic Programming

Chapter 5: Monte Carlo Methods

Chapter 6: Temporal-Difference Learning

Chapter 7: Multi-step Bootstrapping

Chapter 8: Planning and Learning with Tabular Methods

Chapter 9: On-policy Prediction with Approximation

Chapter 10: On-policy Control with Approximation

Chapter 11: Off-policy Methods with Approximation

Chapter 12: Eligibility Traces

Chapter 13: Policy Gradient Methods

Extra