Aaronhuang-778/SliM-LLM
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models
Python
Issues
- 0
Calculating saliency of weight
#7 opened by kiucho - 3
- 4
Clarification on Theorem 1 of the Paper
#6 opened by kiucho - 2
Cannot reproduce
#5 opened by haoming-codes - 2
the quantized bit width
#3 opened by xiaxin1998 - 6
- 1