Bad performance of NLVR2.

Question

Bad performance of NLVR2.

yangxuntu opened this issue 4 years ago · 18 comments

Hi, I also met the problem in https://github.com/airsplay/lxmert/issues/1 and I also only have the performance to be about 50:
Epoch 0: Train 50.31
Epoch 0: Valid 50.86
Epoch 0: Best 50.86

Epoch 1: Train 50.39
Epoch 1: Valid 49.14
Epoch 1: Best 50.86

Epoch 2: Train 50.44
Epoch 2: Valid 49.14
Epoch 2: Best 50.86

Epoch 3: Train 50.57
Epoch 3: Valid 50.86
Epoch 3: Best 50.86
I also tried torch == 1.0.1, but it still did not work. I also wanted to download the data in that link, while the link seems did not exist. Can you reload these features again? Thank you very much!

Answer 1 · 2020-10-08T15:50:33.000Z

Which feature do you mean by the "link"? The default feature link wget --no-check-certificate https://nlp1.cs.unc.edu/data/lxmert_data/nlvr2_imgfeat/train_obj36.zip -P data/nlvr2_imgfeat works well on my side.

Answer 2 · 2020-10-08T15:59:22.000Z

The link for raw feature, which you provide in https://github.com/airsplay/lxmert/issues/1.
Also, I am downloading the features from google drive, I hope these features will work.
But it is really weird that I have these strange performances. Should I use exactly the same environment as yours? Do you have any anaconda environment?

Answer 3 · 2020-10-08T20:46:43.000Z

To the past of my experience, the most possible reason is that the pre-trained model is not loaded correctly.

Could you check whether you have downloaded the pre-trained model and place it in the correct location?

Answer 4 · 2020-10-09T01:02:25.000Z

Ok. I will check it. Thank you very much!

Answer 5 · 2020-10-13T01:24:44.000Z

It's my problem, I did not correctly read all the parameters from the pretrained model. The original code is correct.

Answer 6 · 2021-03-18T09:03:00.000Z

I also meet this problem, could you tell me in detail? thank you !

Answer 7 · 2021-03-21T09:21:55.000Z

Hi, sorry to disturb you, I have a question when running your code on nlvr. When I remove the pre-trained model to train nlvr, the result is :
Epoch 0: Train 50.31
Epoch 0: Valid 50.86
Epoch 0: Best 50.86

Epoch 1: Train 50.39
Epoch 1: Valid 49.14
Epoch 1: Best 50.86

Epoch 2: Train 50.44
Epoch 2: Valid 49.14
Epoch 2: Best 50.86

Epoch 3: Train 50.57
Epoch 3: Valid 50.86
Epoch 3: Best 50.86

So, how how can I train nlvr without pre-trained model? Could you please reply more quickly? I'm in a hurry!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

Answer 8 · 2021-03-21T11:53:52.000Z

In my case, I revised some parts of the original lxmert code to my version. Then I find that the model can only achieve 50 accuracy because the load function in nlvr2.py neglects the effect of `.module' in key.

…

________________________________ 发件人: haoopan ***@***.***> 发送时间: 2021年3月21日 17:22 收件人: airsplay/lxmert ***@***.***> 抄送: #YANG XU# ***@***.***>; State change ***@***.***> 主题: Re: [airsplay/lxmert] Bad performance of NLVR2. (#84) Hi, sorry to disturb you, I have a question when running your code on nlvr. When I remove the pre-trained model to train nlvr, the result is : Epoch 0: Train 50.31 Epoch 0: Valid 50.86 Epoch 0: Best 50.86 Epoch 1: Train 50.39 Epoch 1: Valid 49.14 Epoch 1: Best 50.86 Epoch 2: Train 50.44 Epoch 2: Valid 49.14 Epoch 2: Best 50.86 Epoch 3: Train 50.57 Epoch 3: Valid 50.86 Epoch 3: Best 50.86 So, how how can I train nlvr without pre-trained model? Could you please reply more quickly? I'm in a hurry!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! ― You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub<#84 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AJEJUOSKBDEN5SLEKEAOJE3TEW3EBANCNFSM4SINMY5A>.

Answer 9 · 2021-03-21T11:54:38.000Z

This is my code, but I do not know whether it is suitable to your case.

…

________________________________ 发件人: haoopan ***@***.***> 发送时间: 2021年3月18日 17:03 收件人: airsplay/lxmert ***@***.***> 抄送: #YANG XU# ***@***.***>; State change ***@***.***> 主题: Re: [airsplay/lxmert] Bad performance of NLVR2. (#84) I also meet this problem, could you tell me in detail? thank you ! ― You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub<#84 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AJEJUOS4VI5AHSAFIIEUZV3TEG6VPANCNFSM4SINMY5A>.

Answer 10 · 2021-03-21T12:02:01.000Z

This is my code, but I do not know whether it is suitable to your case.
…
________________________________ 发件人: haoopan @.> 发送时间: 2021年3月18日 17:03 收件人: airsplay/lxmert @.> 抄送: #YANG XU# @.>; State change @.> 主题: Re: [airsplay/lxmert] Bad performance of NLVR2. (#84) I also meet this problem, could you tell me in detail? thank you ! ― You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub<#84 (comment)>, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AJEJUOS4VI5AHSAFIIEUZV3TEG6VPANCNFSM4SINMY5A.

In my case, I didn't change any code, just didn't load the pre-trained model, because I wanted to retrain the entire model without the pre-trained model.

Answer 11 · 2021-03-21T12:05:26.000Z

When I load the pre-trained model, I get normal results.Is this because of some initialization problem? I'm in a hurry!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

Answer 12 · 2021-03-21T12:06:48.000Z

Without loading the pre-trained model, it is normal to get about 50 accuracy.

…

________________________________ 发件人: haoopan ***@***.***> 发送时间: 2021年3月21日 20:02 收件人: airsplay/lxmert ***@***.***> 抄送: #YANG XU# ***@***.***>; State change ***@***.***> 主题: Re: [airsplay/lxmert] Bad performance of NLVR2. (#84) This is my code, but I do not know whether it is suitable to your case. …

________________________________ 发件人: haoopan @.> 发送时间: 2021年3月18日 17:03 收件人: airsplay/lxmert @.> 抄送: #YANG XU# @.>; State change @.> 主题: Re: [airsplay/lxmert] Bad performance of NLVR2. (#84<#84>) I also meet this problem, could you tell me in detail? thank you ! D You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub<#84 (comment)<#84 (comment)>>, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AJEJUOS4VI5AHSAFIIEUZV3TEG6VPANCNFSM4SINMY5A. In my case, I didn't change any code, just didn't load the pre-trained model, because I wanted to retrain the entire model without the pre-trained model. ― You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub<#84 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AJEJUOT5GG2HXTN3DZLLRLLTEXN4PANCNFSM4SINMY5A>.

Answer 13 · 2021-03-21T12:10:11.000Z

Without loading the pre-trained model, it is normal to get about 50 accuracy.
…
________________________________ 发件人: haoopan @.> 发送时间: 2021年3月21日 20:02 收件人: airsplay/lxmert @.> 抄送: #YANG XU# @.>; State change @.> 主题: Re: [airsplay/lxmert] Bad performance of NLVR2. (#84) This is my code, but I do not know whether it is suitable to your case. …
________________________________ 发件人: haoopan @.> 发送时间: 2021年3月18日 17:03 收件人: airsplay/lxmert @.> 抄送: #YANG XU# @.>; State change @.> 主题: Re: [airsplay/lxmert] Bad performance of NLVR2. (#84<#84>) I also meet this problem, could you tell me in detail? thank you ! D You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub<#84 (comment)<#84 (comment)>>, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AJEJUOS4VI5AHSAFIIEUZV3TEG6VPANCNFSM4SINMY5A. In my case, I didn't change any code, just didn't load the pre-trained model, because I wanted to retrain the entire model without the pre-trained model. ― You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub<#84 (comment)>, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AJEJUOT5GG2HXTN3DZLLRLLTEXN4PANCNFSM4SINMY5A.

Can't I train from scratch without loading a pre-training model?

Answer 14 · 2021-03-21T12:11:48.000Z

You need to load the pre-trained model or you can not get 74 accuracy

…

________________________________ 发件人: haoopan ***@***.***> 发送时间: 2021年3月21日 20:10 收件人: airsplay/lxmert ***@***.***> 抄送: #YANG XU# ***@***.***>; State change ***@***.***> 主题: Re: [airsplay/lxmert] Bad performance of NLVR2. (#84) Without loading the pre-trained model, it is normal to get about 50 accuracy. …

________________________________ 发件人: haoopan @.> 发送时间: 2021年3月21日 20:02 收件人: airsplay/lxmert @.> 抄送: #YANG XU# @.>; State change @.> 主题: Re: [airsplay/lxmert] Bad performance of NLVR2. (#84<#84>) This is my code, but I do not know whether it is suitable to your case. …

________________________________ 发件人: haoopan @.> 发送时间: 2021年3月18日 17:03 收件人: airsplay/lxmert @.> 抄送: #YANG XU# @.>; State change @.> 主题: Re: [airsplay/lxmert] Bad performance of NLVR2. (#84<#84><#84<https://github.com/airsplay/lxmert/issues/84>>) I also meet this problem, could you tell me in detail? thank you ! D You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub<#84<#84> (comment)<#84 (comment)<#84 (comment)>>>, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AJEJUOS4VI5AHSAFIIEUZV3TEG6VPANCNFSM4SINMY5A. In my case, I didn't change any code, just didn't load the pre-trained model, because I wanted to retrain the entire model without the pre-trained model. D You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub<#84 (comment)<#84 (comment)>>, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AJEJUOT5GG2HXTN3DZLLRLLTEXN4PANCNFSM4SINMY5A. Can't I train from scratch without loading a pre-training model? ― You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub<#84 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AJEJUORDGGCZY72PA72LYU3TEXO3BANCNFSM4SINMY5A>.

Answer 15 · 2021-03-21T12:14:10.000Z

You need to load the pre-trained model or you can not get 74 accuracy
…
________________________________ 发件人: haoopan @.> 发送时间: 2021年3月21日 20:10 收件人: airsplay/lxmert @.> 抄送: #YANG XU# @.>; State change @.> 主题: Re: [airsplay/lxmert] Bad performance of NLVR2. (#84) Without loading the pre-trained model, it is normal to get about 50 accuracy. …
________________________________ 发件人: haoopan @.> 发送时间: 2021年3月21日 20:02 收件人: airsplay/lxmert @.> 抄送: #YANG XU# @.>; State change @.> 主题: Re: [airsplay/lxmert] Bad performance of NLVR2. (#84<#84>) This is my code, but I do not know whether it is suitable to your case. …
________________________________ 发件人: haoopan @.> 发送时间: 2021年3月18日 17:03 收件人: airsplay/lxmert @.> 抄送: #YANG XU# @.>; State change @.> 主题: Re: [airsplay/lxmert] Bad performance of NLVR2. (#84<#84><#84 #84>) I also meet this problem, could you tell me in detail? thank you ! D You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub<#84<#84> (comment)<#84 (comment)<#84 (comment)>>>, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AJEJUOS4VI5AHSAFIIEUZV3TEG6VPANCNFSM4SINMY5A. In my case, I didn't change any code, just didn't load the pre-trained model, because I wanted to retrain the entire model without the pre-trained model. D You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub<#84 (comment)<#84 (comment)>>, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AJEJUOT5GG2HXTN3DZLLRLLTEXN4PANCNFSM4SINMY5A. Can't I train from scratch without loading a pre-training model? ― You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub<#84 (comment)>, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AJEJUORDGGCZY72PA72LYU3TEXO3BANCNFSM4SINMY5A.

Thanks for your reply. But why does it stay at 50.86? That's weird.

Answer 16 · 2021-03-21T12:15:49.000Z

It's like an untrained guess

Answer 17 · 2021-03-21T12:17:16.000Z

And when I changed the model to my own, the result was 50.86 all the time

Answer 18 · 2021-03-21T12:31:22.000Z

Strangely enough, when I trained the VQA there was no pre-trained model, it could be trained and improved