/Direct-Preference-Optimization

Direct Preference Optimization from scratch in PyTorch

Primary LanguagePython

Stargazers