JAX implementation of Large Language Models. You can train GPT-2-like model with 青空文庫 (aozora bunko-clean dataset) or any other text dataset.
Primary LanguagePythonMIT LicenseMIT
No one’s watching this repository yet.