/Transformer_Distillation

Knowledge Distillation For Transformer Language Models

Primary LanguagePython

Watchers