Implement of fusion operation in MGCN

Question

Implement of fusion operation in MGCN

Opened this issue 5 years ago · 9 comments

Thanks for your great work!
And I didn't find the exact code that implements of fusion operation in MGCN(Eq.10 in your CVPR paper). I think it's the key to use relationship information correctly.
Could you please answer my doubts, thanks!

Answer 1 · 2019-12-18T07:33:29.000Z

This part can not improve the performance so much, thus I delete them in the updated version for faster training.

Answer 2 · 2020-04-09T08:37:32.000Z

I am preparing to reproduce the SGAE project, but due to computer configuration, the original author's tsv data set cannot completely generate cocobu_att, cocobu-box,cocobu_fc. I see your problem under the author's project. I hope I can ask for your help. Can I send the three folders you generated to my mailbox, 997932544@qq.com? Thank you very much and good luck.

Answer 3 · 2020-09-05T13:18:54.000Z

@yangxuntu what do you mean deleting this part?when training, use_rela is set to 0,then how does MGCN work? also,in your paper,Urij and Uai are obtained in a way similar to formula(10),how do you get Vrij and Vai? Are the ROI features of relationships available in the pre-processed files?thank you.

Answer 4 · 2020-09-06T08:40:39.000Z

If you set use_rela=1, you will get a higher performance while will need much more hours to train. And I upload one updated code that without use_rela, the performance is still good enough. If use_rela =1, v_rij and v_ai is also computed by the graph embedding operation.

…

________________________________ 发件人: zhangchenghua123 <notifications@github.com> 发送时间: 2020年9月5日 21:19 收件人: yangxuntu/SGAE <SGAE@noreply.github.com> 抄送: #YANG XU# <S170018@e.ntu.edu.sg>; Mention <mention@noreply.github.com> 主题: Re: [yangxuntu/SGAE] Implement of fusion operation in MGCN (#16) @yangxuntu<https://github.com/yangxuntu> what do you mean deleting this part?when training, use_rela is set to 0,then how does MGCN work? also,in your paper,Urij and Uai are obtained in a way similar to formula(10),how do you get Vrij and Vai? thank you. ― You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub<#16 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AJEJUOWPOFGMKQWISFCOQMTSEI3ETANCNFSM4JQOT34Q>.

Answer 5 · 2020-09-06T08:54:19.000Z

@yangxuntu thank you,but i still do not quite understand.because in https://shiyaya.github.io/2019/03/16/SAGE-Auto-Encoding-Scene-Graphs-for-Image-Captioning/ ,In the image encoder, the input part includes v_r,the relation ROI feature,which is also 2048 dimension. As I know, the object ROI feature is obtained through bottom up attention.What about the RELATION ROI feature?Is it to combine the boxes of subjects and objects involved in a set of relationships, and then use bottom up attention to obtain the corresponding ROI feature?It seems that there is no relation ROI feature in the preprocessed file,should I check it again?

Answer 6 · 2020-09-06T09:02:50.000Z

In this code, I do not provide this part since I write about more than 10 different files about my model, this part of the code is contained in another file. Because I am not good at managing all the codes to write them as a perfect project, I just provide one file which contains the most important part of the whole framework. V_r is the feature extracted from MOTIF, which is different from V_O, this is why I do not provide this part of code because if I provide them, I need to upload a new file about the feature extractor and a new dataloader file. I am a naive coder that time

…

________________________________ 发件人: zhangchenghua123 <notifications@github.com> 发送时间: 2020年9月6日 16:54 收件人: yangxuntu/SGAE <SGAE@noreply.github.com> 抄送: #YANG XU# <S170018@e.ntu.edu.sg>; Mention <mention@noreply.github.com> 主题: Re: [yangxuntu/SGAE] Implement of fusion operation in MGCN (#16) @yangxuntu<https://github.com/yangxuntu> thank you,but i still do not quite understand.because in https://shiyaya.github.io/2019/03/16/SAGE-Auto-Encoding-Scene-Graphs-for-Image-Captioning/ ,In the image encoder, the input part includes v_r,the relation ROI feature,which is also 2048 dimension. As I know, the object ROI feature is obtained through bottom up attention.What about the RELATION ROI feature?Is it to combine the boxes of subjects and objects involved in a set of relationships, and then use bottom up attention to obtain the corresponding ROI feature?It seems that there is no relation ROI feature in the preprocessed file,should I check it again? ― You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub<#16 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AJEJUORWU5DPU4BW6RBMBO3SENE4PANCNFSM4JQOT34Q>.

Answer 7 · 2020-09-06T09:05:29.000Z

@yangxuntu ok .thanks

Answer 8 · 2021-09-15T09:22:35.000Z

In this code, I do not provide this part since I write about more than 10 different files about my model, this part of the code is contained in another file. Because I am not good at managing all the codes to write them as a perfect project, I just provide one file which contains the most important part of the whole framework. V_r is the feature extracted from MOTIF, which is different from V_O, this is why I do not provide this part of code because if I provide them, I need to upload a new file about the feature extractor and a new dataloader file. I am a naive coder that time
…
________________________________ 发件人: zhangchenghua123 notifications@github.com 发送时间: 2020年9月6日 16:54 收件人: yangxuntu/SGAE SGAE@noreply.github.com 抄送: #YANG XU# S170018@e.ntu.edu.sg; Mention mention@noreply.github.com 主题: Re: [yangxuntu/SGAE] Implement of fusion operation in MGCN (#16) @yangxuntu https://github.com/yangxuntu thank you,but i still do not quite understand.because in https://shiyaya.github.io/2019/03/16/SAGE-Auto-Encoding-Scene-Graphs-for-Image-Captioning/ ,In the image encoder, the input part includes v_r,the relation ROI feature,which is also 2048 dimension. As I know, the object ROI feature is obtained through bottom up attention.What about the RELATION ROI feature?Is it to combine the boxes of subjects and objects involved in a set of relationships, and then use bottom up attention to obtain the corresponding ROI feature?It seems that there is no relation ROI feature in the preprocessed file,should I check it again? ― You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub<#16 (comment)>, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AJEJUORWU5DPU4BW6RBMBO3SENE4PANCNFSM4JQOT34Q.

"In the image encoder, the relation ROI feature, which is also 2048 dimension. What about the RELATION ROI feature? " could you please provide a pre-trained relation ROI feature 2048 dimension files. How about this performance added relation ROI feature? and how should I extract the V_r ROI feature from MOTIF easily? that's will help me a lot, thank you!

Answer 9 · 2021-09-15T09:23:37.000Z

Could you please provide a pre-trained relation ROI feature 2048 dimension files？