/tensorflow-rl-tictactoe

Training TensorFlow neural network to play Tic-Tac-Toe game using one-step Q-learning algorithm.

Primary LanguagePython

Training TensorFlow neural network to play Tic-Tac-Toe game using one-step Q-learning algorithm.

Requirements:

References:

  • Michael L. Littman. Markov games as a framework for multi-agent reinforcement learning. Machine Learning, 11:157–163, 1994.
  • W. T. Uther and M. Veloso. Adversarial reinforcement learning, School Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, 1997.
  • R. A. C. Bianchi, C. H. C. Ribeiro, and A. H. R. Costa. Heuristic selection of actions in multiagent reinforcement learning. In IJCAI’07, Hyderabad, India, 2007.