Retrieval Tasks Evaluation

Question

Retrieval Tasks Evaluation

Closed this issue 2 years ago · 3 comments

KOU-201270 commented 2 years ago

I have noticed that only input data are released in the test set of retrieval tasks.
So what steps should I take if I want to evaluate my own model in this task without the corresponding answer?
May I use the execEval engine to check if the retrieved candidates can pass the unit test?
I would really appreciate it if you could answer this.

Answer 1 · 2023-06-27T04:29:18.000Z

No need to use the ExecEval. We will release the gold labels for the retrieval corpus. Our lead author for retrieval task was not available during the final submission. To mitigate the errors, we decided to not release the retrieval gold data. We will release the data shortly. In the meantime, if you need it urgently, please reach out via email.

Answer 2 · 2023-06-27T14:14:57.000Z

Thanks for your kind response.
Looking forward to your further update.
Good luck.

Answer 3 · 2024-07-28T11:46:50.000Z

Hello, have the gold labels of the nl_code retrieval dataset been released?