/simple-mpt30b-ddp

Simple Example to train MPT 30B (Single GPU and DDP) model using LORA and Int8 training

Primary LanguagePython

Watchers