dvlab-research/Step-DPO

question about Data Construction Pipeline

yyht opened this issue · 0 comments

yyht commented

the released data construction pipline use qwen2-instruct to generate the response, for different base model, do we need different instruct model to construct response of sft-dataset