/lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

Primary LanguagePythonMIT LicenseMIT

Watchers