Questions about Critic model

Question

Questions about Critic model

leejaehoon1830 opened this issue 10 months ago · 2 comments

I am curious whether you only used llama2 13b to create data when generating your generation model's training data, or if you also used llama2 7b to generate generation model's training data.

Answer 1 · 2024-03-19T14:40:59.000Z

Hi, If I'm correct, the authors used GPT-4 model to get the training data, check paper for more specific details.

Answer 2 · 2024-03-19T15:51:41.000Z

I know the dataset used for training the critic model in GPT-4, but I want to know about the dataset used for the generation model.