/fleece-benchmark

A benchmark framework for LLM serving performance, based on API call

Primary LanguagePython

Watchers