Pinned Repositories
apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
cythonize-setuptools
Cythonize python code and build wheel distribution package
DALI
A library containing both highly optimized building blocks and an execution engine for data pre-processing in deep learning applications
HadoopOWLQN
ma
incubator-tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
interactive-decision-tree
Data visualization project that helps preprocess and analyze small data set.
nccl
Optimized primitives for collective multi-GPU communication
nccl-tests
NCCL Tests without MPI
ray
A system for parallel and distributed Python that unifies the ML ecosystem.
tensorflow
An Open Source Machine Learning Framework for Everyone
xutianming's Repositories
xutianming/nccl-tests
NCCL Tests without MPI
xutianming/HadoopOWLQN
ma
xutianming/interactive-decision-tree
Data visualization project that helps preprocess and analyze small data set.
xutianming/cythonize-setuptools
Cythonize python code and build wheel distribution package
xutianming/DALI
A library containing both highly optimized building blocks and an execution engine for data pre-processing in deep learning applications
xutianming/incubator-tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
xutianming/nccl
Optimized primitives for collective multi-GPU communication
xutianming/ray
A system for parallel and distributed Python that unifies the ML ecosystem.
xutianming/tensorflow
An Open Source Machine Learning Framework for Everyone
xutianming/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
xutianming/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
xutianming/Megatron-LM
Ongoing research training transformer models at scale
xutianming/playground
xutianming/Text-Classification-PyTorch
Implementation of papers for text classification task on SST-1/SST-2
xutianming/TurboTransformers
a fast and user-friendly runtime for transformer inference on CPU and GPU
xutianming/xutianming.github.com