pruning

There are 432 repositories under pruning topic.

datawhalechina/leedl-tutorial
《李宏毅深度学习教程》（李宏毅老师推荐👍），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases
Language:Jupyter Notebook10k 257 762.6k
IntelLabs/distiller
Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller
Language:Jupyter Notebook4.3k 132 350798
neuralmagic/deepsparse
Sparsity-aware deep learning inference runtime for CPUs
Language:Python2.9k 55 128169
VainF/Torch-Pruning
[CVPR 2023] Towards Any Structural Pruning; LLMs / SAM / Diffusion / Transformers / YOLOv8 / CNNs
Language:Python2.4k 32 316301
he-y/Awesome-Pruning
A curated list of neural network pruning resources.
2.2k 87 27327
666DZY666/micronet
micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape
Language:Python2.2k 40 109477
intel/neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Language:Python2k 34 186243
neuralmagic/sparseml
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
Language:Python2k 47 198141
quic/aimet
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
Language:Python2k 47 1.3k354
PaddlePaddle/PaddleSlim
PaddleSlim is an open-source library for deep model compression and architecture search.
Language:Python1.5k 92 539348
tensorflow/model-optimization
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
Language:Python1.5k 119 347320
open-mmlab/mmrazor
OpenMMLab Model Compression Toolbox and Benchmark.
Language:Python1.4k 20 265218
cupcakearmy/autorestic
Config driven, easy backup cli for restic.
Language:Go1.1k 10 18766
huawei-noah/Efficient-Computing
Efficient computing methods developed by Huawei Noah's Ark Lab
Language:Jupyter Notebook1.1k 22 118198
jacobgil/pytorch-pruning
PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference
Language:Python868 22 43204
openvinotoolkit/nncf
Neural Network Compression Framework for enhanced OpenVINO™ inference
Language:Python836 30 300210
Syencil/mobile-yolov5-pruning-distillation
mobilev2-yolov5s剪枝、蒸馏，支持ncnn，tensorRT部署。ultra-light but better performence！
Language:Jupyter Notebook815 9 81166
csarron/awesome-emdl
Embedded and mobile deep learning research resources
724 85 2166
alibaba/TinyNeuralNetwork
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
Language:Python716 21 129117
horseee/LLM-Pruner
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.
Language:Python695 13 5873
SforAiDl/KD_Lib
A Pytorch Knowledge Distillation library for benchmarking and extending works in the domains of Knowledge Distillation, Pruning, and Quantization.
Language:Python576 16 6357
he-y/filter-pruning-geometric-median
Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration (CVPR 2019 Oral)
Language:Python567 8 76113
princeton-nlp/LLM-Shearing
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
Language:Python477 26 6835
cedrickchee/awesome-ml-model-compression
Awesome machine learning model compression research papers, tools, and learning material.
449 23 058
SpursLipu/YOLOv3v4-ModelCompression-MultidatasetTraining-Multibackbone
YOLO ModelCompression MultidatasetTraining
Language:Python443 8 130136
BenWhetton/keras-surgeon
Pruning and other network surgery for trained Keras models.
Language:Python407 17 57107
he-y/soft-filter-pruning
Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
Language:Python375 9 3674
1duo/awesome-ai-infrastructures
Infrastructures™ for Machine Learning Training/Inference in Production.
370 36 071
neuralmagic/sparsezoo
Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes
Language:Python362 26 2123
airaria/TextPruner
A PyTorch-based model pruning toolkit for pre-trained language models
Language:Python354 5 1631
talebolano/yolov3-network-slimming
yolov3 network slimming剪枝的一种实现
Language:Python346 12 4893
huggingface/optimum-intel
🤗 Optimum Intel: Accelerate inference with Intel optimization tools
Language:Jupyter Notebook339 36 6294
mehtadushy/SelecSLS-Pytorch
Reference ImageNet implementation of SelecSLS CNN architecture proposed in the SIGGRAPH 2020 paper "XNect: Real-time Multi-Person 3D Motion Capture with a Single RGB Camera". The repository also includes code for pruning the model based on implicit sparsity emerging from adaptive gradient descent methods, as detailed in the CVPR 2019 paper "On implicit filter level sparsity in Convolutional Neural Networks".
Language:Python338 19 1444
megvii-research/Sparsebit
A model compression and acceleration toolbox based on pytorch.
Language:Python321 12 3039
rahulvigneswaran/Lottery-Ticket-Hypothesis-in-Pytorch
This repository contains a Pytorch implementation of the paper "The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks" by Jonathan Frankle and Michael Carbin that can be easily adapted to any model/dataset.
Language:Python317 7 1690
neuralmagic/sparsify
ML model optimization product to accelerate inference.
Language:Python315 27 2227

pruning

datawhalechina/leedl-tutorial

IntelLabs/distiller

neuralmagic/deepsparse

VainF/Torch-Pruning

he-y/Awesome-Pruning

666DZY666/micronet

intel/neural-compressor

neuralmagic/sparseml

quic/aimet

PaddlePaddle/PaddleSlim

tensorflow/model-optimization

open-mmlab/mmrazor

cupcakearmy/autorestic

huawei-noah/Efficient-Computing

jacobgil/pytorch-pruning

openvinotoolkit/nncf

Syencil/mobile-yolov5-pruning-distillation

csarron/awesome-emdl

alibaba/TinyNeuralNetwork

horseee/LLM-Pruner

SforAiDl/KD_Lib

he-y/filter-pruning-geometric-median

princeton-nlp/LLM-Shearing

cedrickchee/awesome-ml-model-compression

SpursLipu/YOLOv3v4-ModelCompression-MultidatasetTraining-Multibackbone

BenWhetton/keras-surgeon

he-y/soft-filter-pruning

1duo/awesome-ai-infrastructures

neuralmagic/sparsezoo

airaria/TextPruner

talebolano/yolov3-network-slimming

huggingface/optimum-intel

mehtadushy/SelecSLS-Pytorch

megvii-research/Sparsebit

rahulvigneswaran/Lottery-Ticket-Hypothesis-in-Pytorch

neuralmagic/sparsify