/d3po

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

Primary LanguagePythonMIT LicenseMIT

Watchers