Junqi-Zhang/AETN

question about the L2 code

Opened this issue · 0 comments

image

Sorry . I have one question about the codes.

Why the code to compute the bottleneck_L2 and the code to compute the transformer_L2 is the same when the parameter only_part is False?

Looking forward your reply!