/nlp_ddp

PyTorch DistributedDataParallel training for Transformer models.

Primary LanguagePython

Watchers