/pretraining-with-human-feedback

Code accompanying the paper Pretraining Language Models with Human Preferences

Primary LanguagePythonMIT LicenseMIT

Stargazers