LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization
Primary LanguagePythonMIT LicenseMIT