qqingzheng/AI-Self-Training-DPO-SDXL
Unofficial implementation. Stable diffusion model trained by AI Feedback-Based Self-Training Direct Preference Optimization.
Python
Issues
- 0
Question on fine-tuning Stable Diffusion
#5 opened by TomLucidor - 0
Poor dpo_beta default?
#4 opened by feffy380 - 2
data question
#3 opened by unwritten - 3
- 8
learning rate adjustment
#1 opened by 1073521013