/vae_mdp

Implementation of Variational Markov Decision Processes, a framework allowing to (i) distill policies learned through (deep) reinforcement learning and (ii) learn discrete abstractions of continuous environments, the two with bisimulation guarantees.

Primary LanguageJupyter NotebookGNU General Public License v3.0GPL-3.0

Watchers