/worker-tgi

The RunPod worker template for serving our large language model endpoints. Powered by Text Generation Inference.

Primary LanguagePythonMIT LicenseMIT

Watchers