Code and data for paper "Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation" accepted by ACL 2024.
Primary LanguagePythonApache License 2.0Apache-2.0