KTH Reinforcement Learning (EL2805) 2019 coding assignments. As all my other repos, this is more an exercice for me to understand the algorithms than useful code. Hope it also helps you!
Agent (green) escaping (blue) a maze with walls (black) with a monster (red) following a uniform random walk capable of crossing walls: code
Agent (green) robbing banks (blue) while escaping a police (red) which follows a random walk, never going away from him: code
Policy learned by the agent for every Police (red) position: code
Agent (green) robbing again banks (blue) while escaping a police (red) who follows a random walk: code