/hands-on-rl

Primary LanguageJupyter Notebook

The Hands-on Reinforcement Learning course ๐Ÿš€

From zero to HERO ๐Ÿฆธ๐Ÿปโ€๐Ÿฆธ๐Ÿฝ

Out of intense complexities, intense simplicities emerge

--Winston Churchill

Welcome โค๏ธ

Welcome to my step by step hands-on-course that will take you from basic reinforcement learning to cutting-edge deep RL.

We will start with a short intro of what RL is, what is it used for, and how does the landscape of current RL algorithms look like.

Then, in each following chapter we will solve a different problem, with increasing difficulty:

  • ๐Ÿ† easy
  • ๐Ÿ†๐Ÿ† medium
  • ๐Ÿ†๐Ÿ†๐Ÿ† hard

Ultimately, the most complex RL problems involve a mixture of reinforcement learning algorithms, optimizations and Deep Learning techniques.

You do not need to know deep learning (DL) to follow along this course.

I will give you enough context to get you familiar with DL philosophy and understand how it becomes a crucial ingredient in modern reinforcement learning.

Contents

  1. Introduction to Reinforcement Learning
  2. Q-learning to drive a taxi ๐Ÿ†
  3. SARSA to beat gravity ๐Ÿ†

00. Intro to reinforcement learning

๐Ÿ‘‰ Read in datamachines ๐Ÿ‘‰ Read in Towards Data Science

  • What is a Reinforcement Learning problem? ๐Ÿค”
  • Policies ๐Ÿ‘ฎ๐Ÿฝ and value functions.
  • How to generate the training data? ๐Ÿ“Š
  • Python boilerplate code.๐Ÿ
  • Recap โœจ
  • Homework ๐Ÿ“š
  • Whatโ€™s next? โค๏ธ

01. Q-Learning to drive a taxi ๐Ÿ†

๐Ÿ‘‰ Read in datamachines ๐Ÿ‘‰ Read in Towards Data Science

  • The taxi driving problem ๐Ÿš•
  • Environment, actions, states, rewards
  • Random agent baseline ๐Ÿค–๐Ÿท
  • Q-learning agent ๐Ÿค–๐Ÿง 
  • Hyper-parameter tuning ๐ŸŽ›๏ธ
  • Recap โœจ
  • Homework ๐Ÿ“š
  • What's next? โค๏ธ

02. SARSA to beat gravity ๐Ÿ†

๐Ÿ‘‰ Read in datamachines ๐Ÿ‘‰ Read in Towards Data Science

  • The Mountain car problem ๐Ÿšƒ
  • Environment, actions, states, rewards
  • Random agent baseline ๐Ÿšƒ๐Ÿท
  • SARSA agent ๐Ÿšƒ๐Ÿง 
  • Take a pause and breath โธ๐Ÿง˜
  • Recap โœจ
  • Homework ๐Ÿ“š
  • Whatโ€™s next? โค๏ธ