/d3po

Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

Primary LanguagePythonMIT LicenseMIT

No issues in this repository yet.