/ScTD

Revisiting Token Dropping Strategy in Efficient BERT Pretraining

Primary LanguagePython

No issues in this repository yet.