The classic Reinforcement Learning problem solved using a simple Feedforward Neural Network with PyTorch. This was an assignment in the Decision Models course at University of Milano-Bicocca.
The assignment's text is in the file assignment.pdf
.
The algorithm's output is a list of all the successful episodes each with their relative last epoch and state.