Closed this issue 7 months ago · 0 comments
[Current] 97. ORPO: Monolithic Preference Optimization without Reference Model 98/ Do Large Language Models Understand Logic or Just Mimick Context?
[Proposed] 97. ORPO: Monolithic Preference Optimization without Reference Model 98. Do Large Language Models Understand Logic or Just Mimick Context?