/Reinforcement-Learning-SoC

Repository for the SoC on Reinforcement Learning

Primary LanguageJupyter Notebook

Reinforcement Learning

This is the repository for the 2023 Seasons of Code Project "Hands on Reinforcement Learning".

Mentors

  • Ashwin Abraham
  • Jujhaar Singh
  • Prerak Contractor
  • Sankalp Bhamare

Instructions for Mentees

The assignments for each week will be posted here. You must make a pull request with your solutions, and the best solution will be accepted.

Your solution should ideally be in the form of a Jupyter Notebook. Some assignments will build off each other to form the project.

Schedule

Week Reading Assignment
Week 1 Dynamic Programming (S&B Chapter 4)
Introduction to Deep RL (Grokking Chapter 1)
Link
Week 2 Monte Carlo Methods (S&B Chapter 5) Link
Week 3 Temporal Difference Learning and Q Learning (S&B Chapter 6)
Introduction to Value Based Deep RL (Grokking Chapter 8)
Week 4 Eligibility Traces (S&B Chapter 7)
Week 5 Tabular Methods(S&B Chapter 8)
Deep Q Learning Networks (Grokking Chapters 9, 10)
Week 6 Approximate Solution Methods (S&B Chapters 9, 11)
Week 7 Applications (S&B Chapter 14)
Advanced Methods (Grokking Chapters 11, 12)

Resources

A ton of resources have been linked here but we'll be mainly following the first three, and will utilize the rest as and when required. The notes will be updated regularly with each week's material.