/dpo

Scripts for fine-tuning Llama2 via SFT and DPO.

Primary LanguagePython

Watchers

No one’s watching this repository yet.