Has the switching self-attention been applied to all stages?

Question

Has the switching self-attention been applied to all stages?

ziyaxuanyi opened this issue 6 months ago · 2 comments

The paper said the switching self-attention should only be applied to late upblocks of Unet to achieve the best results.
But this codes seem that the switching self-attention is applied to all stages.

Answer 1 · 2024-03-21T18:44:38.000Z

This seems to be the case! Completely overlooked it.

Answer 2 · 2024-03-22T22:13:58.000Z

I'm going to close this as this functionality has been implemented.

Please feel free to ping again if there are any concerns that relate to this issue.