/dpo

Scripts for fine-tuning Llama2 via SFT and DPO.

Primary LanguagePython

Stargazers

No one’s star this repository yet.