/First-Principles-Transformers

An end-to-end walk through of transformer architecture and heuristics, for my own (and potentially others') learning purposes. Updated continuously.

Primary LanguageJupyter NotebookMIT LicenseMIT

Stargazers

No one’s star this repository yet.