/minimind

🚀🚀 「大模型」3小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 3 hours!

Primary LanguagePythonApache License 2.0Apache-2.0

Watchers