980202006 opened this issue 2 years ago · 2 comments
This model is great, but the calculation speed is a bit slow, I want to try to distill this model, can you give some advice? Such as which layers can be reduced or removed
You could reduce the number of transformer blocks.
Thank you!