0ppxnhximxr/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
PythonApache-2.0
Stargazers
No one’s star this repository yet.
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
PythonApache-2.0
No one’s star this repository yet.