/LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Primary LanguagePythonMIT LicenseMIT

Stargazers