hustzxd

PhD of Institute of Computing Technology (ICT), University of Chinese Academy of Sciences (UCAS).

AMDBeijing

Pinned Repositories

ZeroQ
[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework
Language:Python274 17 2656
blog
my blog backup
Language:Stylus3 1 03
EagleEyeEFF
Implement channel pruning using the latest Torch.FX feature !!! && EagleEye reimplementation
Language:Python6 1 01
EfficientPyTorch
A PyTorch Framework for Efficient Pruning and Quantization for specialized accelerators.
Language:Jupyter Notebook32 2 511
LSQuantization
The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)
Language:Jupyter Notebook125 5 921
PaperListTemplate
This template makes it easy for you to manage papers.
Language:Python2 2 01
QuanOview
Quantization Overview 不全面总结
Language:TeX8 2 01
z0
Language:Makefile27 2 711
z1
Language:Python38 4 1024
EfficientPaperList
Paper about Pruning, Quantization, and Efficient-inference/training.
Language:Python3 0 00

hustzxd's Repositories

hustzxd/LSQuantization
The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)
Language:Jupyter Notebook125 5 921
hustzxd/EfficientPyTorch
A PyTorch Framework for Efficient Pruning and Quantization for specialized accelerators.
Language:Jupyter Notebook32 2 511
hustzxd/EagleEyeEFF
Implement channel pruning using the latest Torch.FX feature !!! && EagleEye reimplementation
Language:Python6 1 01
hustzxd/blog
my blog backup
Language:Stylus3 1 03
hustzxd/PaperListTemplate
This template makes it easy for you to manage papers.
Language:Python2 2 01
hustzxd/examples-run
A set of examples around pytorch in Vision with TRAINING BASH.
Language:Python1 2 0
hustzxd/ABCPruner
Pytorch implementation of our paper accepted by IJCAI 2020 -- Channel Pruning via Automatic Structure Search
Language:Python1 0
hustzxd/aimet
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
Language:Python1 0
hustzxd/ASKs
Asks: Convolution with any-shape kernels for efficient neural networks (Neurocomputing.2021)
Language:Python0 0
hustzxd/attention-is-all-you-need-paper
Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.
Language:Jupyter Notebook0 0
hustzxd/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
0 0
hustzxd/awesome-image-transformer
List of all the papers on Transformers for Vision.
1 0
hustzxd/BitSplit
BitSplit Post-trining Quantization
Language:Python1 0
hustzxd/Dynamic-convolution-Pytorch
Pytorch!!!Pytorch!!!Pytorch!!! Dynamic Convolution: Attention over Convolution Kernels (CVPR-2020)
Language:Python1 0
hustzxd/dynamic-pruning
Language:Python1 0
hustzxd/EagleEye
(ECCV'2020 Oral)EagleEye: Fast Sub-net Evaluation for Efficient Neural Network Pruning
Language:Python0 0
hustzxd/hustzxd
2 0
hustzxd/ictlogin
Language:Python0 0
hustzxd/litgpt
Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
Language:Python0 0
hustzxd/llama
Inference code for LLaMA models
Language:Python0 0
hustzxd/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python0 0
hustzxd/MQBench
Model Quantization Benchmark
Language:Python1 0
hustzxd/pytorch-cifar
95.47% on CIFAR10 with PyTorch
Language:Python0 0
hustzxd/pytorch-cifar-models
Pretrained models on CIFAR10/100 in PyTorch
Language:Python0 0
hustzxd/rocmstat
📊 A simple command-line utility for querying and monitoring GPU status
Language:Python0 0
hustzxd/simplenote-android
Simplenote for Android
Language:Java0 0
hustzxd/supermariopy
python library, scripts and notebooks that are usfull from time to time
Language:Python0 0
hustzxd/triton
Development repository for the Triton language and compiler
Language:C++0 0
hustzxd/tutorials
PyTorch tutorials.
Language:Python0 0
hustzxd/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Language:Python0 0