Some small projects to learn the low level details of attention mechanisms and transformers.
Primary LanguageJupyter Notebook