anujnayyar1's Stars
bentoml/BentoDiffusion
BentoDiffusion: A collection of diffusion models served with BentoML
mit-han-lab/smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
IST-DASLab/gptq
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".