Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.
Primary LanguagePythonApache License 2.0Apache-2.0