[Feature Request]: Evaluation Harness

Question

[Feature Request]: Evaluation Harness

Closed this issue a month ago · 0 comments

Feature Description

Our agents and prompting methods can return the answer, but there still needs to be a harness to parse this outputted answer and generate a metric value.

Reason

No response