/LLM-RLHF-Tuning

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Primary LanguagePython

Watchers

No one’s watching this repository yet.