Neural Magic

Neural Magic

Neural Magic empowers developers to optimize and deploy LLMs at scale. Our model compression and acceleration enable top performance with vLLM.

Location:Boston

Pinned Repositories

Neural Magic's Repositories