Demo training Alpaca Farm dataset with trl DPO
Primary LanguagePython
No issues in this repository yet.