Pinned Repositories
Capsule
A two-phase trigger where a watcher watches the changes and a doer reacts to the changes
cutlass
CUDA Templates for Linear Algebra Subroutines
Demos
Cheatsheet for fast coding
EnvDeployment
Environment deployment for Linux Ubuntu including VPN, website
nccl
Optimized primitives for collective multi-GPU communication
nnscaler
PaGraph
SoCC'20 and TPDS'21: Scaling GNN Training on Large Graphs via Computation-aware Caching and Partitioning.
RDMA-MXNet-ps-lite
RDMA Optimization on MXNet
Tessel
HPCA'2024 & TPDS'2024 Tessel: Boosting Distributed Execution of Large DNN Models via Flexible Schedule Search
undergraduate-lab
This repo shows some of my undergraduate courses' labs @USTC
zhiqi-0's Repositories
zhiqi-0/PaGraph
SoCC'20 and TPDS'21: Scaling GNN Training on Large Graphs via Computation-aware Caching and Partitioning.
zhiqi-0/RDMA-MXNet-ps-lite
RDMA Optimization on MXNet
zhiqi-0/Tessel
HPCA'2024 & TPDS'2024 Tessel: Boosting Distributed Execution of Large DNN Models via Flexible Schedule Search
zhiqi-0/undergraduate-lab
This repo shows some of my undergraduate courses' labs @USTC
zhiqi-0/EnvDeployment
Environment deployment for Linux Ubuntu including VPN, website
zhiqi-0/nnscaler
zhiqi-0/Capsule
A two-phase trigger where a watcher watches the changes and a doer reacts to the changes
zhiqi-0/cutlass
CUDA Templates for Linear Algebra Subroutines
zhiqi-0/Demos
Cheatsheet for fast coding
zhiqi-0/nccl
Optimized primitives for collective multi-GPU communication
zhiqi-0/Toys
A toy tool for C++ library
zhiqi-0/PyPDFCrop
pdfcrop commandline tools for windows/linux/macos