neuralmagic/nm-vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
PythonNOASSERTION
Stargazers
- a-tabazaPrincess Sumaya University for Technology
- addvin
- antferdom@datacrunch-research
- AswanthManojCo-founder | azma.ai
- BecomeAllanBrazil
- camenduru🥪tost.ai
- cnmoroWise Systems
- devfacetMA, USA
- dsikka@neuralmagic, Columbia University, University of Waterloo
- ekormanStriveworks
- Fire-
- fpcsongOPPO
- Franckevicius
- harrybolingot
- hollsteinBerlin
- huyangqiu
- jeanniefinksNeural Magic
- kingkong135
- liubonan123
- LucasWilkinson
- marcella-found
- mgoin@neuralmagic
- mklasbyUniversity of Calgary
- mwitiderrick
- nidhoggr-nil
- photosbanIntel Corporation
- ProExpertProgNeural Magic
- rgreenberg1Neural Magic
- robertgshaw2-neuralmagic@neuralmagic
- shiqingzhangCSUCentral south university
- shyamsn97
- SuperSecureHumanMars
- sutyum@TechnocultureResearch
- VfBfoerst
- vilsonrodrigues
- vvonchain