Pinned Repositories
AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
lm-evaluation-harness
A framework for few-shot evaluation of language models.
kolbeuk's Repositories
kolbeuk doesn’t have any repository yet.