[Feature Request]: Evaluation Harness
Closed this issue · 0 comments
alckasoc commented
Feature Description
Our agents and prompting methods can return the answer, but there still needs to be a harness to parse this outputted answer and generate a metric value.
Reason
No response