/StackingBERT

Source code for "Efficient Training of BERT by Progressively Stacking"

Primary LanguagePythonOtherNOASSERTION

Stargazers