is there any info constitutional_evaluation function??

Question

yeongsang2 opened this issue a year ago · 2 comments

3.3 Training Loop

Answer 1 · 2023-08-24T05:04:48.000Z

I guess evaluate_response_for_RL() could be used as a workaround. This function tries to use OpenAI's text-davinci-003 to return a reward number.

Answer 2 · 2023-08-24T05:06:00.000Z

I guess evaluate_response_for_RL() could be used as a workaround. This function tries to use OpenAI's text-davinci-003 to return a reward number.

I think so too, thanks for the reply.