xfactlab/orpo

Official repository for ORPO

PythonApache-2.0

Issues

compute_logps function, why does it return also prob for the last token of answer
#35 opened 3 months ago by javismiles
7
Could you please tell me which OpenAI API you used during the MT-Bench evaluation?
#34 opened 3 months ago by hitszxs
2
Can you please give us a guide line to use your method to train on the LLAMA factory?
#20 opened 3 months ago by TonyQJH
1
Recreating the setup with CUDA 12.1; grad norm is nan
#31 opened 3 months ago by Jayant1234
2
Can you share your training loss log for Mistral-ORPO-β (7B)?
#26 opened 3 months ago by yechenzhi
11
[Question] how does orpo combine dpo into sft ?
#33 opened 4 months ago by wj-Mcat
1
Loss device for ORPOTrainer
#18 opened 6 months ago by ganeshkrishnan1
16
Discarding the prompt tokens only with the positive labels and not with the negative ones
#32 opened 4 months ago by javismiles
2
Doubt about the formatting of the `prompt`, `chosen` and `rejected`
#7 opened 7 months ago by alvarobartt
2
[Question] ORPO Fine-tuning Data Format
#30 opened 4 months ago by nooobodynose
2
Unexpected results using ORPO trl
#28 opened 4 months ago by celsowm
4
Poor performance on llama3
#27 opened 5 months ago by JasonZhu1313
3
no reference model?
#23 opened 5 months ago by kxleee
1
Is there any statement error in 4.2 of the paper？
#25 opened 6 months ago by Chic-star
2
Memory Consumption
#22 opened 6 months ago by paulcx
1
what's the difference between alpha and beta version?
#21 opened 6 months ago by Labmem009
1
how to do ORPO with ShareGPT data?
#11 opened 6 months ago by pabl-o-ce
2
attention mask in compute_logps function
#17 opened 6 months ago by hjc3613
3
how to install requirements.txt on colab?
#12 opened 6 months ago by srn-source
3
[Question] Memory requiremsnts for ORPOTrainer
#13 opened 6 months ago by snassimr
1
[Question] ORPO + SFTTrainer + QLora
#10 opened 6 months ago by snassimr
1
Run with Deepspeed raise ERROR: Gradient computed twice for this partition
#9 opened 6 months ago by iFe1er
3
Suggestion: Calculate NLL Loss on neutral preference set.
#6 opened 6 months ago by linux-leo
2
prompt formatting issue
#5 opened 7 months ago by RonanKMcGovern
12
Questions regarding the reproduction of Mistral's results on mt-bench
#4 opened 7 months ago by JasonZhu1313
11
4xA6000 training - failed to save model
#3 opened 7 months ago by rkinas
7