/transformer-walkthrough

A walkthrough of transformer architecture code

Primary LanguageJupyter NotebookMIT LicenseMIT

A walkthrough of transformer architecture code

Open In Colab

The notebook walks through a single forward pass of the Transformer architecture in pytorch. It is meant for illustration and educational purposes only. The walkthrough explains every stage of the architecture accompanied by a detailed computation graph.

Transformer Computation Graph