/Steel-LLM

Train a 1B LLM with 1T tokens from scratch by personal

Primary LanguagePython

Watchers