msramada/Active-Learning-Reinforcement-Learning

This code can be used to reproduce the results in our paper ``Actively Learning Reinforcement Learning: A Stochastic Optimal Control Approach''.

Jupyter NotebookMIT

Watchers