/QuaRot

[Fork] Code for QuaRot, an end-to-end 4-bit inference of large language models.

Primary LanguagePythonApache License 2.0Apache-2.0

Watchers

No one’s watching this repository yet.