for
The book can be found: https://web.stanford.edu/class/psych209/Readings/SuttonBartoIPRLBook2ndEd.pdf
Have other solutions or think mine are wrong? Share your thoughts (open an issue)!
Chapter 1 - Introduction
Chapter 2 - Multi-armed Bandits
Chapter 3 - Finite Markov Decision Processes
Chapter 4 - Dynamic Programming
Chapter 5 - Monte Carlo Methods
Chapter 6 - Temporal-Di↵erence Learning
Chapter 7 - n-step Bootstrapping
Chapter 8 - Planning and Learning with Tabular Methods