LiMa-cas

Pinned Repositories

BiLLM
(ICML 2024) BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
Language:Python204 6 1814
AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Language:Python1.9k 15 429229
quip-sharp
Language:Python513 12 6844
flute
Fast Matrix Multiplications for Lookup Table-Quantized LLMs
Language:Cuda213 4 128
llmc
[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".
Language:Python375 10 4743
EfficientQAT
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models
Language:Python235 4 2518
SqueezeLLM
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
Language:Python667 18 2843
Llama3.1-Finetuning
对llama3进行全参微调、lora微调以及qlora微调。
Language:Python170 3 816
AQLM
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression https://arxiv.org/abs/2405.14852
Language:Python1.2k 19 102182
SpQR
Language:Python535 19 2543

LiMa-cas doesn’t have any repository yet.