kevin-hanselman/grid-world-rl

Value iteration, policy iteration, and Q-Learning in a grid-world MDP.

PythonMIT

Readme
0Issues
24Stargazers
4Watchers

Watchers

eemailme
junkyul
kevin-hanselman
Maine, USA
MannyKayy
Edinburgh Centre for Robotics

Contact site admin: Geeks.