qishibo/RL-robot-maze

基于强化学习Q-Learning方法实现机器人走迷宫

Python

强化学习之Q-Learning算法实践机器人走迷宫

Python依赖：

numpy
pandas
tkinter

执行过程：

python main.py

其中selected action right为每幕选择的action，R为该幕的总回报值