复现论文《Distilling Task-Specific Knowledge from BERT into Simple Neural Networks》
Primary LanguagePythonMIT LicenseMIT