/q-learning-delusion

A counterexample for Q-Learning, discussed in "Non-delusional Q-learning and value-iteration."

Primary LanguageJupyter NotebookMIT LicenseMIT

No issues in this repository yet.