Had you tried Transformers's MSDA implementation?
NineMeowICT opened this issue · 1 comments
NineMeowICT commented
I think Transformers library can provide a more general implementation, providing a better cross-platform capability.
I'm interested in it. If you had tried it, I wonder if it degraded the inference performance?
NineMeowICT commented
Sorry, I misunderstanded your model's architecture