intel/neural-speed
An innovative library for efficient LLM inference via low-bit quantization
C++Apache-2.0
Stargazers
- iakashpaul
- amadeuzou
- ld-william
- YuanQinHongXin
- nnn112358
- youngho-bae
- wutthichai46
- Zhenzhong1
- shamio
- matthewdouglasIndianapolis, IN
- sroussey
- jgjlBay Area
- Richardyu114Shanghai
- yingfengChina
- lmsreborn
- camilovista2010Spain
- SweetpopcornSimonAustria
- PastramiKing
- ZanyRain
- ArtisticCoding
- AdityaKulshresthaIndia
- smilexinShanghai.PuDong
- denzukoNew York, New York
- zhijl
- lkk12014402beijing
- Pang-GJHangzhou, Zhejiang, China
- bratao
- opticbluTexas
- Ben-PerlinManchester, NH
- niranjanaryanGreater Bengaluru, India
- my-vegetable-has-explodedBeijing
- zaxtax
- negrinhoNew York
- BlueKiji77
- yutianchen666
- bkj