SARSA and Q-learning Reinforcement Learning methods on a Windy Grid World using PyTorch. The picture below shows the state space. Arrows represent the strength of the wind flowing upwards in each column.
SARSA and Q-learning performance experiment:
pytorch
numpy
opencv
matplotlib
.