OptimalScale/LMFlow

[DPO is available?]

chowkamlee81 opened this issue · 2 comments

Is your feature request related to a problem? Please describe.
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]

Describe the solution you'd like
A clear and concise description of what you want to happen.

Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.

Additional context
Add any other context or screenshots about the feature request here.

Thanks for your interest in LMFlow! Also thanks for the suggestion, we are considering providing a list a scripts in LMFlow, or another repository under OptimalScale. We will let you know once it is available 😄

Thanks for your valuable suggestions! We have provided a simple example script to support DPO (https://github.com/OptimalScale/LMFlow/blob/main/scripts/run_dpo_align.sh). You may modify corresponding arguments in the script for your own needs. If you encounter further issues, please feel free to let us know. Your feedbacks mean a lot to us 😄