elphinkuo/dense_sparse_quant_hessian
Dense and sparse quantization of open source large lange models, (LLama2, Vicuna), based on Hessian space information. Keeping high accurance and breaking the Memeory Wall.
Apache-2.0
No issues in this repository yet.