uclaml/POWERS

Near-optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs

Jupyter NotebookApache-2.0

Readme
0Issues
0Stargazers
2Watchers

Watchers

eemailme
uclaml
Department of Computer Science, UCLA

Contact site admin: Geeks.