TZW1998/Taming-Stable-Diffusion-with-Human-Ranking-Feedback

This is the official repo for the paper "Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles", Tang et al. https://arxiv.org/abs/2303.03751

Jupyter NotebookMIT