pashminacameron/QuaRot
[Fork] Code for QuaRot, an end-to-end 4-bit inference of large language models.
PythonApache-2.0
Watchers
No one’s watching this repository yet.
[Fork] Code for QuaRot, an end-to-end 4-bit inference of large language models.
PythonApache-2.0
No one’s watching this repository yet.