/AQLM

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf

Primary LanguagePythonApache License 2.0Apache-2.0

Watchers