/lm-human-preference-details

RLHF implementation details of OAI's 2019 codebase

Primary LanguagePythonMIT LicenseMIT

Watchers