Adding a scorer without rerunning existing scorers.

Question

Adding a scorer without rerunning existing scorers.

Opened this issue a month ago · 2 comments

I sometimes want to score a task with an additional scorer but not have to rerun the scorers that have already been run. Is there a way to do that?

Answer 1 · 2024-10-27T20:57:12.000Z

There is a actually a score() function you can use for this (but I don't think it's properly documented on the website, we will remedy this!): https://github.com/UKGovernmentBEIS/inspect_ai/blob/main/src/inspect_ai/_eval/score.py

Answer 2 · 2024-10-28T12:17:06.000Z

Doesn't the score function simply return a new EvalLog with only the newly specified scorers? Could there perhaps be a flag that it accepts which when checked, appends the scorer and its results to the EvalLog?

Also, the score functions can only process a single EvalLog at a time. This is quite a minor thing but it'd be nice if it can score multiple Evallogs simultaneously.

Thanks for the quick response btw!