/Multi-Agent-RL-with-TF

Training intrinsically motivated, independent Q-learners to play Tic-Tac-Toe

Primary LanguageJupyter Notebook

Watchers