Generative Pretrained Model (GPT) in JAX. A step by step guide to train LLMs on large datasets from scratch
Primary LanguageJupyter NotebookMIT LicenseMIT