How do you get the evaluation results of the `prompt enginering` leaderboard?

Question

How do you get the evaluation results of the `prompt enginering` leaderboard?

zhimin-z opened this issue 9 months ago · 3 comments

Check https://promptbench.readthedocs.io/en/latest/leaderboard/pe.html:

I failed to locate any related report or paper that could reproduce these evaluation results...
@madhavMathur @jindongwang @msftgits @dnfclas

Answer 1 · 2024-02-20T22:58:29.000Z

In the latest version of the paper, only Figure 4 is directly related to the leaderboard above:

However, this figure is just a subset of these evaluation results and there are no numeric values shown on the top of the chart bars. Would you mind adding more explanation to the benchmark documentation?

Answer 2 · 2024-03-19T08:32:42.000Z

@icecream-and-tea can answer this question

Answer 3 · 2024-03-19T09:21:55.000Z

For the sake of convenience in presentation, we only included the main part of the prompt engineering evaluation results in the paper. The evaluation of all data can be found in this page ( https://promptbench.readthedocs.io/en/latest/leaderboard/pe.html)

Answer 4 · 2024-03-19T15:51:09.000Z

For the sake of convenience in presentation, we only included the main part of the prompt engineering evaluation results in the paper. The evaluation of all data can be found in this page ( https://promptbench.readthedocs.io/en/latest/leaderboard/pe.html)

Add a PR accordingly: #58