/DeeperSpeed

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

Primary LanguagePythonMIT LicenseMIT

License MIT

DeeperSpeed

DeeperSpeed is a fork of Microsoft's Deepspeed library that is tailor-made for the GPT-NeoX by EleutherAI.

Prior to 3/9/2023, DeeperSpeed was based on an old version of DeepSpeed (0.3.15). In order to migrate to the latest upstream DeepSpeed version while allowing users to access the old versions of GPT-NeoX and DeeperSpeed, we have introduced two versioned releases for both libraries: