YifeiZhou02/ArCHer

请问我们的工作中,对于webshop的训练有提前进行SFT模型么?

xiaxiaxiatengxi opened this issue · 2 comments

请问我们的工作中,对于webshop的训练有提前进行SFT模型么?

Yes, it is in the provided google drive link in the README.

DZ9 commented

Is gpt2_bc_workshop_history.pt the sft model for webshop? Does it means I need to load the model for training acther in rl step? How can I train a new sft model with different base LLM like Llama2? Thanks.