ahavenoname/q-learning-delusion
A counterexample for Q-Learning, discussed in "Non-delusional Q-learning and value-iteration."
Jupyter NotebookMIT
No issues in this repository yet.
A counterexample for Q-Learning, discussed in "Non-delusional Q-learning and value-iteration."
Jupyter NotebookMIT
No issues in this repository yet.