Pinned Repositories
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
DejaVu
torch-int
This repository contains integer operators on GPUs for PyTorch.
sparse_gpu_operator
GPU operators for sparse tensor operations
123312
ControlNet_TensorRT
天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案
lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
torch-int
This repository contains integer operators on GPUs for PyTorch.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
sleepcoo's Repositories
sleepcoo/123312
sleepcoo/ControlNet_TensorRT
天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案
sleepcoo/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
sleepcoo/torch-int
This repository contains integer operators on GPUs for PyTorch.
sleepcoo/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs