Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
Primary LanguageC++MIT LicenseMIT