Wrong computation of metrics for implicit ethics

Question

Closed this issue 4 months ago · 1 comments

Hello, thank you for your amazing work!
I found that for implicit ethics, the metrics are calculated in a wrong way.

Specifically, it happens here. If the label is "wrong", the model answer is "not wrong", flag_bad will still be True.

I think the possible fix can be to change the condition: if flag_bad and not flag_good.

Answer 1 · 2024-05-12T06:39:13.000Z

Hi,

Thanks for your careful reminder! We have fixed this error. 🥰