Dhruvee Birla and myself.
This assignment was done as a part of the Machine, Data and Learning course, Spring 2021.
The goal of this exercise is to understand Markov Decision Processes using the Value Iteration algorithm.
The problem statements have been mentioned in Assignment.pdf. Task 2 is similar to Task 1, but with the subtasks having a very little variation from the problem in Task 1.
The notebook has the code for the main task (Task 1). The scripts for each task are in the code directory, with each file having a variation of a small number of lines from each other. The trace files generated from these scripts are stored in outputs. The report analyses the algorithm, our approach and the outputs.