/Parameter-Efficient-MoE

Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks

Primary LanguagePythonApache License 2.0Apache-2.0

Stargazers