holarissun/BenchmarkPromptsWithResponses

Every prompt engineering paper should provide not only on-average performance of the prompting strategy, but should also release the responses to facilitate future research and avoid repeatedly calling the LLMs for the same queries+prompts.

BenchmarkPromptsWithResponses