/learning-from-human-feedback

Reimplementation of OpenAI's "Learning to summarize from human feedback"

MIT LicenseMIT

Watchers