/preference-learning-with-rationales

This is the public repository for Data-Centric Human Preference Optimization with Rationales.

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers