Official implementation of Bootstrapping Language Models via DPO Implicit Rewards
Primary LanguagePythonMIT LicenseMIT
No issues in this repository yet.