llSourcell/Doctor-Dignity

is there any info constitutional_evaluation function??

yeongsang2 opened this issue · 2 comments

3.3 Training Loop

Screenshot 2023-08-23 at 9 11 04 PM

I guess evaluate_response_for_RL() could be used as a workaround. This function tries to use OpenAI's text-davinci-003 to return a reward number.

I guess evaluate_response_for_RL() could be used as a workaround. This function tries to use OpenAI's text-davinci-003 to return a reward number.

I think so too, thanks for the reply.