/max-bellman-toy

Code for gold mining environment described in the "Maximum Reward Formulation In Reinforcement Learning" paper

Primary LanguageJupyter Notebook

max-bellman-toy

Code for gold mining environment described in the "Maximum Reward Formulation In Reinforcement Learning" paper