jwang0306/transformer-pytorch
A PyTorch implementation of Transformer, experimenting with both Post-LN (Post-LayerNorm) and Pre-LN (Pre-LayerNorm).
Jupyter Notebook
A PyTorch implementation of Transformer, experimenting with both Post-LN (Post-LayerNorm) and Pre-LN (Pre-LayerNorm).
Jupyter Notebook