Log model responses directly to file and reuse them for debugging

Question

bauersimon opened this issue 6 months ago · 1 comments

Goal, be able to use exactly 1:1 responses from a previous run to debug the evaluation logic.

log model responses directly to files (either on provider query response level or generate test level)
add dummy model that takes these files and responds accordingly (essentially mimicking/replaying the original model responses)

Answer 1 · 2024-07-04T08:02:56.000Z

Duplicate of #204. Closing.