uanu2002/JSQ

[ICML 2024] JSQ: Compressing Large Language Models by Joint Sparsification and Quantization

MIT

JSQ: Compressing Large Language Models by Joint Sparsification and Quantization

Official code for ICML 2024 paper[Compressing Large Language Models by Joint Sparsification and Quantization]