tansey/linear_ttt
A framework for experimenting with different linear function approximators with gradient-descent Sarsa(lambda) following an epsilon-greedy policy in Tic-Tac-Toe.
Python
No issues in this repository yet.
A framework for experimenting with different linear function approximators with gradient-descent Sarsa(lambda) following an epsilon-greedy policy in Tic-Tac-Toe.
Python
No issues in this repository yet.