【大模型】3小时完全从0训练一个仅有26M的小参数GPT,最低仅需2G显卡即可推理训练!
Primary LanguagePythonApache License 2.0Apache-2.0
No issues in this repository yet.