robertalanm/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
PythonApache-2.0
No issues in this repository yet.
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
PythonApache-2.0
No issues in this repository yet.