/MDPs-and-Q-learning-On-Ice

Using Markov Decision Processes and Q-Learning on a variation of the Wumpus World problem.

Primary LanguageJupyter Notebook

Stargazers