dskart/reinforcement_learning

implementation of a Q-learning agent from scratch with epsilon-greedy actions on different environments (2D grid world and pacman game)

PythonMIT

Watchers