/ddpo-pytorch

Reproduction of DDPO paper (RLHF for diffusion)

Primary LanguageJupyter Notebook

Watchers