/batch-inference

Dynamic batching library for Deep Learning inference. Tutorials for LLM, GPT scenarios.

Primary LanguagePythonMIT LicenseMIT

Watchers