Pinned Repositories
client
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
FasterTransformer
Transformer related optimization, including BERT, GPT
NeMo
NeMo: a toolkit for conversational AI
python_backend
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
pytriton
PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.
python_backend
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
pytriton
PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.
pziecina-nv's Repositories
pziecina-nv/client
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
pziecina-nv/FasterTransformer
Transformer related optimization, including BERT, GPT
pziecina-nv/NeMo
NeMo: a toolkit for conversational AI
pziecina-nv/python_backend
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
pziecina-nv/pytriton
PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.