/POWERS

Near-optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

Watchers