int5
There are 1 repositories under int5 topic.
intel/neural-speed
An innovative library for efficient LLM inference via low-bit quantization
There are 1 repositories under int5 topic.
An innovative library for efficient LLM inference via low-bit quantization