/Active-Learning-Reinforcement-Learning

This code can be used to reproduce the results in our paper ``Actively Learning Reinforcement Learning: A Stochastic Optimal Control Approach''.

Primary LanguageJupyter NotebookMIT LicenseMIT

Watchers