/SARSA-and-Q-learning-on-a-Windy-Grid-World

SARSA and Q-learning on a Windy Grid World

Primary LanguagePythonMIT LicenseMIT

SARSA and Q-learning on a Windy Grid World

About the Project

SARSA and Q-learning Reinforcement Learning methods on a Windy Grid World using PyTorch. The picture below shows the state space. Arrows represent the strength of the wind flowing upwards in each column.

windygrid

Example

SARSA and Q-learning performance experiment:

trace_sarsa

trace_qlearning

Prerequisites

  • pytorch
  • numpy
  • opencv
  • matplotlib.