A DEMO simulation to practice Q-Learning against Minimax with alpha-beta pruning