/GBLM-Pruner

Are gradient information useful for pruning of LLMs?

Primary LanguagePythonMIT LicenseMIT