/dash-infer

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.

Primary LanguageC++Apache License 2.0Apache-2.0

Watchers

No one’s watching this repository yet.