frankxu2004/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
PythonApache-2.0
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
PythonApache-2.0