A question

Question

A question

Closed this issue 3 months ago · 3 comments

Thank you for your great work!But I am a little confused about formula 5 in the paper.Why add Muser to Mseg?I think Muser is larger than Mseg, so what's the point of this addition operation, why not just use Muser?

Answer 1 · 2024-07-10T13:12:58.000Z

Thank you. It is the XOR operation rather than addition.

Answer 2 · 2024-07-10T14:07:34.000Z

Thank you.And in the code, why only replace cross-attention's forward function rather than self-attention in register_attention_control?The paper mentioned the self-attention.

Answer 3 · 2024-07-11T00:10:33.000Z

no, in our code, both self-attention and cross-attention are composed and injected.