MCG-NKU/E2FGVI

Request a suggestion for model distillation

980202006 opened this issue · 2 comments

This model is great, but the calculation speed is a bit slow, I want to try to distill this model, can you give some advice? Such as which layers can be reduced or removed

You could reduce the number of transformer blocks.

Thank you!