Pinned Repositories
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
aravis
A vision library for genicam based cameras
core
The core library and APIs implementing the Triton Inference Server.
dify-image-mirror
models
A collection of pre-trained, state-of-the-art models in the ONNX format
nccl_cuda118
Various nccl versions compiled based on cuda118
python_backend
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
sample
A tritonserver example
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
vllm_whl_repo
zhaotyer's Repositories
zhaotyer/vllm_whl_repo
zhaotyer/sample
A tritonserver example
zhaotyer/aravis
A vision library for genicam based cameras
zhaotyer/core
The core library and APIs implementing the Triton Inference Server.
zhaotyer/dify-image-mirror
zhaotyer/models
A collection of pre-trained, state-of-the-art models in the ONNX format
zhaotyer/nccl_cuda118
Various nccl versions compiled based on cuda118
zhaotyer/python_backend
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
zhaotyer/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs