/SRPO

Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).

Primary LanguagePythonMIT LicenseMIT

Watchers