Opened this issue 3 years ago · 0 comments
Sorry . I have one question about the codes.
Why the code to compute the bottleneck_L2 and the code to compute the transformer_L2 is the same when the parameter only_part is False?
Looking forward your reply!