Pinned Repositories
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
neural-compressor
Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.
oneDNN
oneAPI Deep Neural Network Library (oneDNN)
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
torchao-fork
The torchao repository contains api's and workflows for quantization and pruning gpu models.
torchutils
Torch helper functions
yiliu30's Repositories
yiliu30/ai-research
yiliu30/awesome-model-quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
yiliu30/baidu-allreduce
yiliu30/BRECQ
Pytorch implementation of BRECQ, ICLR 2021
yiliu30/brpc
brpc is an Industrial-grade RPC framework using C++ Language, which is often used in high performance system such as Search, Storage, Machine learning, Advertisement, Recommendation etc. "brpc" means "better RPC".
yiliu30/CodeXGLUE
CodeXGLUE
yiliu30/cs344
Introduction to Parallel Programming class code
yiliu30/dist_tuto.pth
Official code for "Writing Distributed Applications with PyTorch", PyTorch Tutorial
yiliu30/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
yiliu30/extension-cpp
C++ extensions in PyTorch
yiliu30/former
Simple transformer implementation from scratch in pytorch.
yiliu30/HAWQ
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
yiliu30/leveldb
LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.
yiliu30/Machine-Learning-Collection
A resource for learning about Machine learning & Deep Learning
yiliu30/mpitutorial
MPI programming lessons in C and executable code examples
yiliu30/nlp-tutorial
Natural Language Processing Tutorial for Deep Learning Researchers
yiliu30/nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
yiliu30/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
yiliu30/optimum-intel
🤗 Optimum Intel: Accelerate inference with Intel optimization tools
yiliu30/patternlets
Minimalist, scalable, working programs that illustrate parallel design patterns in MPI, OpenMP, Pthreads, ...
yiliu30/pytorch-Deep-Learning
Deep Learning (with PyTorch)
yiliu30/pytorch.github.io
The website for PyTorch
yiliu30/smoothquant
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
yiliu30/Test
yiliu30/tvm_mlir_learn
tvm learn