/S-DPO

[NeurIPS 2024] The implementation of paper "On Softmax Direct Preference Optimization for Recommendation"

Primary LanguagePython

Stargazers