SolveBeliefMDP

Build Status

Gifs displaying the deep RL policy's performance on the LaserTag domain

Vanila LaserTag (Discrete Robot State and Action Space with exact belief updates)

variant1

Modified LaserTag (Continuous Robot State and Action Space with exact belief updates)

variant3