/value-iteration

Understanding Markov Decision Processes using Value Iteration algorithm. For MDL course, Spring 2021.

Primary LanguagePython

forthebadge

Contributors

Dhruvee Birla and myself.

This assignment was done as a part of the Machine, Data and Learning course, Spring 2021.


Value Iteration

The goal of this exercise is to understand Markov Decision Processes using the Value Iteration algorithm.

The problem statements have been mentioned in Assignment.pdf. Task 2 is similar to Task 1, but with the subtasks having a very little variation from the problem in Task 1.

The notebook has the code for the main task (Task 1). The scripts for each task are in the code directory, with each file having a variation of a small number of lines from each other. The trace files generated from these scripts are stored in outputs. The report analyses the algorithm, our approach and the outputs.