/ShortGPT

Unofficial implementations of block/layer-wise pruning methods for LLMs.

Primary LanguageJupyter NotebookMIT LicenseMIT

Issues