In this project, I wrote implementations of many classical dynamic programming algorithms.
In this section, I create an instance of the FrozenLake environment.
In this section, I wrote my own implementation of iterative policy evaluation.
In this section, I worte a function that takes the state-value function estimate as input, along with some state . It returns the row in the action-value function corresponding to the input state
. That is, my function accepts as input both
and
, and return
for all
.
In this section, I wrote my own implementation of policy improvement.
In this section, I wrote my own implementation of policy iteration. The algorithm returns the optimal policy, along with its corresponding state-value function.
In this section, I wrote my own implementation of truncated policy iteration.
In this section, I wrote my own implementation of value iteration.