/JSQ

[ICML 2024] JSQ: Compressing Large Language Models by Joint Sparsification and Quantization

MIT LicenseMIT

JSQ: Compressing Large Language Models by Joint Sparsification and Quantization

Official code for ICML 2024 paper[Compressing Large Language Models by Joint Sparsification and Quantization]