lightweight, standalone C++ inference engine for Google's Gemma models.
Primary LanguageC++Apache License 2.0Apache-2.0