My attempt at training GPT-1 and learning about multi-gpu training
Lots of stuff taken from https://github.com/pytorch/examples/tree/master/word_language_model
My attempt at training GPT-1 and learning about multi-gpu training
Lots of stuff taken from https://github.com/pytorch/examples/tree/master/word_language_model