dskart/reinforcement_learning
implementation of a Q-learning agent from scratch with epsilon-greedy actions on different environments (2D grid world and pacman game)
PythonMIT
implementation of a Q-learning agent from scratch with epsilon-greedy actions on different environments (2D grid world and pacman game)
PythonMIT