[ICML 2024] JSQ: Compressing Large Language Models by Joint Sparsification and Quantization
MIT LicenseMIT
No one’s watching this repository yet.