/inferflow

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

Primary LanguageC++MIT LicenseMIT

This repository is not active