/la-mbda

LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization

Primary LanguagePythonMIT LicenseMIT

Watchers