int6
There are 1 repositories under int6 topic.
intel/neural-speed
An innovative library for efficient LLM inference via low-bit quantization
There are 1 repositories under int6 topic.
An innovative library for efficient LLM inference via low-bit quantization