/PGRD

Implementing the "Reward Design via Online Gradient Ascent" paper

Primary LanguageJupyter Notebook

Stargazers