/pytorch-gpt-x

Implementation of autoregressive language model using improved Transformer and DeepSpeed pipeline parallelism.

Primary LanguagePython

Watchers