/dense_sparse_quant_hessian

Dense and sparse quantization of open source large lange models, (LLama2, Vicuna), based on Hessian space information. Keeping high accurance and breaking the Memeory Wall.

Apache License 2.0Apache-2.0

Watchers