SARSA and Q-learning on a Windy Grid World

About the Project

SARSA and Q-learning Reinforcement Learning methods on a Windy Grid World using PyTorch. The picture below shows the state space. Arrows represent the strength of the wind flowing upwards in each column.

Example

SARSA and Q-learning performance experiment:

Prerequisites

pytorch
numpy
opencv
matplotlib.

TheUnsolvedDev/SARSA-and-Q-learning-on-a-Windy-Grid-World

SARSA and Q-learning on a Windy Grid World

About the Project

Example

Prerequisites