/transformer_layer_swap

An experiment demonstrating the layers of a transformers model can be swapped without breaking the model

Primary LanguageJupyter NotebookMIT LicenseMIT

Stargazers