Pinned Repositories
flashinfer
FlashInfer: Kernel Library for LLM Serving
sglang
SGLang is a fast serving framework for large language models and vision language models.
clangd-static-binary-centos7
flux
A fast communication-overlapping library for tensor parallelism on GPUs.
hnbot
Hacker News Bot
ht
A secure socks5 proxy, designed to protect your Internet traffic.
lmdeploy-build
Nightly Build for LMDeploy
lmdeploy-fork-sync
medusa-whl-centos7
python_backend
r23.02 fix
zhyncs's Repositories
zhyncs/ht
A secure socks5 proxy, designed to protect your Internet traffic.
zhyncs/lmdeploy-build
Nightly Build for LMDeploy
zhyncs/flux
A fast communication-overlapping library for tensor parallelism on GPUs.
zhyncs/clangd-static-binary-centos7
zhyncs/hnbot
Hacker News Bot
zhyncs/lmdeploy-fork-sync
zhyncs/medusa-whl-centos7
zhyncs/python_backend
r23.02 fix
zhyncs/sglang-fork-sync
zhyncs/zhyncs
About me
zhyncs/zhyncs.github.io
https://zhyncs.com
zhyncs/TensorRT-LLM-Hacks
TensorRT LLM Hacks
zhyncs/Workshop-TRT-LLM