Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.
Primary LanguageJupyter Notebook