/MoEBERT

This PyTorch package implements MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation (NAACL 2022).

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers