Code for the paper Fine-Tuning Language Models from Human Preferences
Primary LanguagePythonMIT LicenseMIT