/ReaLHF

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers