/LlamaRLHFHub

A hub that integrates a variety of effective algorithms to optimize the "Llama-2" model, focusing on the RLHF of the "Llama-2" model.

Primary LanguagePython

LlamaRLHFHub

A hub that integrates a variety of effective algorithms to optimize the "Llama-2" model, focusing on the RLHF of the "Llama-2" model.