InftyAI/llmlite

Support quantization

kerthcet opened this issue · 0 comments

  • int8
  • int4
  • gptq