agential-ai/agential

[Feature Request]: Evaluation Metrics

Opened this issue · 0 comments

Feature Description

Evaluation metrics like f1, precision, recall, EM, fuzzy match?, pass@k and any other ones relevant to our currently supported benchmarks

Reason

No response