ro-ko/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
PythonApache-2.0
No issues in this repository yet.
Reference implementation for DPO (Direct Preference Optimization)
PythonApache-2.0
No issues in this repository yet.