What is common.Scale(1) means?

Question

What is common.Scale(1) means?

Opened this issue 3 years ago · 1 comments

class Scale(nn.Module):
    def __init__(self, init_value=1e-3):
        super().__init__()
        self.scale = nn.Parameter(torch.FloatTensor([init_value]))

    def forward(self, input):
        return input * self.scale

When the self.scale=1, does this option does nothing?
Why do we need this layer?

Answer 1 · 2022-07-20T13:49:59.000Z

Is the self.scale learnable parameters 𝜆𝑥 in the paper?