Pinned Repositories
msccl-executor-nccl
nccl
Optimized primitives for collective multi-GPU communication
Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
sem-simlarity
sem-simlarity between two sentences
sok_test
torchrec
Pytorch domain library for recommendation systems
zero-shot-gcn
Zero-Shot Learning with GCN (CVPR 2018)
HugeCTR
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
torchrec
Pytorch domain library for recommendation systems
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
kanghui0204's Repositories
kanghui0204/nccl
Optimized primitives for collective multi-GPU communication
kanghui0204/Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
kanghui0204/sem-simlarity
sem-simlarity between two sentences
kanghui0204/sok_test
kanghui0204/torchrec
Pytorch domain library for recommendation systems
kanghui0204/zero-shot-gcn
Zero-Shot Learning with GCN (CVPR 2018)