/Scratch2LM

Training transformer models (e.g. RoBERTa, GPT2 and GPT-J) from scratch.

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

Watchers