distilled Self-Critique refines the outputs of a LLM with only synthetic data
Primary LanguageJupyter NotebookMIT LicenseMIT