Some results issue

Question

viyjy opened this issue 3 years ago · 4 comments

Hi Junnan, I have the following questions about the result. Hope that you can help to clarify them, thanks.

VQA:
I get the result folder after fine-tuning on VQA dataset. Which json file should I use to get test-dev and test-std?
SNLI-VE:
This is the log file after fine-tuning on SNLI-VE dataset. You didn't update the best-epoch, so it is always 0. Should I pick the row which has the best val accuracy as the final result?
Grouding
This is the log file after fine-tuning on Ref-COCO. Should I pick the row which has the best val_d as the final result?
NLVR2
This is the log file after fine-tuning on NLVR2, but I did't find dev and test-P as shown in your paper, any idea?

Answer 1 · 2021-10-26T04:13:50.000Z

Hi, thanks for your interest. Here are my answers.

VQA: vqa_result_epoch7.json is the final result which collects results from all ranks.
SNLI-VE: thanks for spotting my mistake. Yes you should use the row with best val_acc.
Grounding: if I remembered correctly, in the paper I just reported the last epoch's result.
NLVR: dev is val, test-P is test.

Answer 2 · 2021-10-26T04:22:57.000Z

Thanks very much!

Answer 3 · 2022-01-18T07:56:23.000Z

Hello! Can NLVR2 datasets be shared?

Answer 4 · 2022-01-18T15:51:45.000Z