My custom decoder-only, autoregressive, GPT-style model.
Primary LanguageJupyter NotebookMIT LicenseMIT
My implementation of an autoregressive, decoder-only transfomer for math.