Excercises and (my) solutions

for

Reinforcement Learning: An Introduction (2nd edition)

Have other solutions or think mine are wrong? Share your thoughts (open an issue)!

Chapter 1 - Introduction

Chapter 2 - Multi-armed Bandits

Chapter 3 - Finite Markov Decision Processes

Chapter 4 - Dynamic Programming

Chapter 5 - Monte Carlo Methods

Chapter 6 - Temporal-Di↵erence Learning

Chapter 7 - n-step Bootstrapping

Chapter 8 - Planning and Learning with Tabular Methods