krasserm/perceiver-io

Share weights of embedding layer with output layer in `CausalLanguageModel`

krasserm opened this issue · 0 comments

Share weights of embedding layer with output layer in `CausalLanguageModel`