intel/neural-speed
An innovative library for efficient LLM inference via low-bit quantization
C++Apache-2.0
Stargazers
- airMengIntel
- bqpro1Warsaw
- chuanmingliuWesteros
- dawmster
- eruma
- eugenesiowSingapore
- Fire-
- ftian1Intel
- gurusuraSura Systems Private Limited
- hshen14
- JatinTiwaricodes
- jpmansonInteractive Dynamics
- kevinintel
- kontroluzmani
- learning-chip
- lukaszpluzynski
- luoyu-intelIntel
- lvliang-intel
- Minhui-XieTsinghua University
- MrBenzWorld
- PerfericParis
- redtwiggy
- RinoahuISU
- ShivamKumar2002Rein Games Private Limited
- shuxiaokaiM78 Nebula
- smellslikemlSmellsLikeML
- spiderman001ShenZhen China
- stamate@Birkbeck-Computer-Science-Research
- tripathiarpan20
- WhiteDevil3012
- XxroxiplxXWrocław University of Science and Technology,
- yiliu30AI Frameworks Engineer @Intel
- zbhhbzzbhnot-a-penny.Limited
- Zengyf-CVer
- zhentaoyuintel
- zhewang1-intcIntel