Pinned Repositories
developer_tools
javacpp-presets
The missing Java distribution of native C++ libraries
python_backend
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
pytorch_backend
The Triton backend for the PyTorch TorchScript models.
VRC-4478F-2021
i dont know what im doing
DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
developer_tools
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
baojunliu's Repositories
baojunliu/VRC-4478F-2021
i dont know what im doing
baojunliu/developer_tools
baojunliu/javacpp-presets
The missing Java distribution of native C++ libraries
baojunliu/python_backend
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
baojunliu/pytorch_backend
The Triton backend for the PyTorch TorchScript models.