Create a Triton backend

Question

Create a Triton backend

Closed this issue 5 months ago · 3 comments

Would it be possible to create a Triton backend from this implementation?

A Triton backend is the implementation that executes a model. A backend can be a wrapper around a deep-learning framework, like PyTorch, TensorFlow, TensorRT or ONNX Runtime. Or a backend can be custom C/C++ logic performing any operation (for example, image pre-processing).

Answer 1 · 2024-04-03T12:43:22.000Z

I don't have direct experience with Triton-Inference-Server, I'll look into it in the nex days

Answer 2 · 2024-04-03T21:44:51.000Z

@EwoutH I think you confused OpenAI Triton (the Language) with Nvidia Triton (a API server in C++)

Answer 3 · 2024-04-03T21:49:05.000Z

Right, from the Readme I didn’t figure that. Thanks for clearing that up!