Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
Primary LanguagePythonMIT LicenseMIT