hustzxd
PhD of Institute of Computing Technology (ICT), University of Chinese Academy of Sciences (UCAS).
AMDBeijing
Pinned Repositories
ZeroQ
[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework
blog
my blog backup
EagleEyeEFF
Implement channel pruning using the latest Torch.FX feature !!! && EagleEye reimplementation
EfficientPyTorch
A PyTorch Framework for Efficient Pruning and Quantization for specialized accelerators.
LSQuantization
The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)
PaperListTemplate
This template makes it easy for you to manage papers.
QuanOview
Quantization Overview 不全面总结
z0
z1
EfficientPaperList
Paper about Pruning, Quantization, and Efficient-inference/training.
hustzxd's Repositories
hustzxd/LSQuantization
The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)
hustzxd/EfficientPyTorch
A PyTorch Framework for Efficient Pruning and Quantization for specialized accelerators.
hustzxd/EagleEyeEFF
Implement channel pruning using the latest Torch.FX feature !!! && EagleEye reimplementation
hustzxd/blog
my blog backup
hustzxd/PaperListTemplate
This template makes it easy for you to manage papers.
hustzxd/examples-run
A set of examples around pytorch in Vision with TRAINING BASH.
hustzxd/ABCPruner
Pytorch implementation of our paper accepted by IJCAI 2020 -- Channel Pruning via Automatic Structure Search
hustzxd/aimet
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
hustzxd/ASKs
Asks: Convolution with any-shape kernels for efficient neural networks (Neurocomputing.2021)
hustzxd/attention-is-all-you-need-paper
Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.
hustzxd/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
hustzxd/awesome-image-transformer
List of all the papers on Transformers for Vision.
hustzxd/BitSplit
BitSplit Post-trining Quantization
hustzxd/Dynamic-convolution-Pytorch
Pytorch!!!Pytorch!!!Pytorch!!! Dynamic Convolution: Attention over Convolution Kernels (CVPR-2020)
hustzxd/dynamic-pruning
hustzxd/EagleEye
(ECCV'2020 Oral)EagleEye: Fast Sub-net Evaluation for Efficient Neural Network Pruning
hustzxd/hustzxd
hustzxd/ictlogin
hustzxd/litgpt
Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
hustzxd/llama
Inference code for LLaMA models
hustzxd/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
hustzxd/MQBench
Model Quantization Benchmark
hustzxd/pytorch-cifar
95.47% on CIFAR10 with PyTorch
hustzxd/pytorch-cifar-models
Pretrained models on CIFAR10/100 in PyTorch
hustzxd/rocmstat
📊 A simple command-line utility for querying and monitoring GPU status
hustzxd/simplenote-android
Simplenote for Android
hustzxd/supermariopy
python library, scripts and notebooks that are usfull from time to time
hustzxd/triton
Development repository for the Triton language and compiler
hustzxd/tutorials
PyTorch tutorials.
hustzxd/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators