anthony-intel/neural-speed
An innovative library for efficient LLM inference via low-bit quantization
C++Apache-2.0
Stargazers
No one’s star this repository yet.
An innovative library for efficient LLM inference via low-bit quantization
C++Apache-2.0
No one’s star this repository yet.