/gpt-neox

An implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.

Primary LanguagePythonMIT LicenseMIT

Stargazers

No one’s star this repository yet.