[Feature] nn.LazyLinear
Opened this issue · 2 comments
holdjun commented
What is the feature?
The use of nn.LazyLinear
will result in an error in the _dump_init_info
function of the BaseModule
. The main reason is that _dump_init_info
writes the model's weights to the saved info, and if nn.LazyLinear
is used, the weights are not initialized before the forward pass, leading to an error.
It is hoped that support for the use of nn.LazyLinear
can be provided.
Any other context?
https://pytorch.org/docs/stable/generated/torch.nn.modules.lazy.LazyModuleMixin.html#torch.nn.modules.lazy.LazyModuleMixin
https://pytorch.org/docs/stable/generated/torch.nn.LazyLinear.html#torch.nn.LazyLinear