/cartpole

Sarsa(λ) and Q-learning on the interrupted and uninterrupted cart-pole

Primary LanguageJupyter NotebookMIT LicenseMIT