/DLMA

Code and data for paper "Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation" accepted by ACL 2024.

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers