An experiment demonstrating the layers of a transformers model can be swapped without breaking the model
Open the layer_swap.ipynb notebook to run an example code that swaps the layers of phi1.5 without loss in accuracy
An experiment demonstrating the layers of a transformers model can be swapped without breaking the model
Jupyter NotebookMIT