florentdelgrange/vae_mdp

Implementation of Variational Markov Decision Processes, a framework allowing to (i) distill policies learned through (deep) reinforcement learning and (ii) learn discrete abstractions of continuous environments, the two with bisimulation guarantees.

Jupyter NotebookGPL-3.0

Watchers

gaperez64
Antwerp, Belgium
florentdelgrange