A high-performance serving framework for ML models, offers dynamic batching and multi-stage pipeline to fully exploit your compute machine
Primary LanguagePythonApache License 2.0Apache-2.0