/jax-llm

JAX implementation of Large Language Models. You can train GPT-2-like model with 青空文庫 (aozora bunko-clean dataset) or any other text dataset.

Primary LanguagePythonMIT LicenseMIT

Watchers

No one’s watching this repository yet.