This is the code for the Q-Learning algorithm applied to the Cliff Walking problem defined by Sutton & Barto.
The notebook is a kind of guide for studying and understanding the problem solution.
This repository is an implementation of a Reinforcement Learning algorithm called Q-Learning. It solves the Cliff Walking problem defined by Sutton and Barto. It uses Python and the OpenAI Cliff Walking environment.
Jupyter Notebook