kingchc's Stars
pytorch/torchrec
Pytorch domain library for recommendation systems
facebookresearch/dietgpu
GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compression of numerical and other data types in HPC/ML applications.
openucx/ucc
Unified Collective Communication Library
facebookresearch/fairring
Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large scales
pytorch/kineto
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
openucx/xccl
facebookresearch/param
PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for evaluation of training and inference platforms.
facebookresearch/dlrm
An implementation of a deep learning recommendation model (DLRM)
portante/pycscope
Cscope database generator for Python source code
NVIDIA-developer-blog/code-samples
Source code examples from the Parallel Forall Blog
Triple-L/CV_Resume_latex_template
CV/Resume_latex_template
pmodels/mpich
Official MPICH Repository
intel/mpi-benchmarks
tensorflow/tensorflow
An Open Source Machine Learning Framework for Everyone
horovod/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
openucx/ucx
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
NVIDIA/nccl
Optimized primitives for collective multi-GPU communication
LLNL/Comb
Comb is a communication performance benchmarking tool.
NVIDIA/gdrcopy
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
amix/vimrc
The ultimate Vim configuration (vimrc)
amacgregor/dot-files
Dotfiles repository
detailyang/awesome-cheatsheet
:beers: awesome cheatsheet