/Stanford-CS-234-RL-2022

Solutions to the Stanford CS:234 Reinforcement Learning 2022 course assignments.

Primary LanguagePythonMIT LicenseMIT

Stanford-CS-234-RL-2022

Solutions to the Stanford CS:234 Reinforcement Learning 2022 course assignments.

Course website: https://web.stanford.edu/class/cs234/

Assignment 1

Frozen Lake Markov Decision Process using Value Iteration and Policy Iterasion

         Policy Iteration                   Value Iteration          

Assignment 2

Tabular Q Learning and Deep Q Learning

Learning Curve on the test environment:

Assignment 3

Policy Gradient Methods and REINFORCE

Learning Curve of the REINFORCE algorithm on CartPole-v0:

Assignment 4

Aplication of Bandit Algorithms in the medical setting

Comparison of different Bandit Algorithms:

Assignment 5

Aplication of Upper Confidence Bandit in personalized Recomendation Systems

Comparison of different arm update strategies: