exlaw/DLMA

Code and data for paper "Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation" accepted by ACL 2024.

PythonApache-2.0

Stargazers