/transformer-pytorch

A PyTorch implementation of Transformer, experimenting with both Post-LN (Post-LayerNorm) and Pre-LN (Pre-LayerNorm).

Primary LanguageJupyter Notebook

Watchers