THU-MIG / torch-model-compression

Licence: MIT license

针对pytorch模型的自动化模型结构分析和修改工具集，包含自动分析模型结构的模型压缩算法库

Programming Languages

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to torch-model-compression

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape

Stars: ✭ 1,232 (+877.78%)

Mutual labels: pruning, quantization, model-compression, onnx

InsightFace-REST

InsightFace REST API for easy deployment of face recognition services with TensorRT in Docker.

Stars: ✭ 308 (+144.44%)

Mutual labels: tensorrt, onnx, tensorrt-conversion

Awesome Ml Model Compression

Awesome machine learning model compression research papers, tools, and learning material.

Stars: ✭ 166 (+31.75%)

Mutual labels: pruning, quantization, model-compression

Kd lib

A Pytorch Knowledge Distillation library for benchmarking and extending works in the domains of Knowledge Distillation, Pruning, and Quantization.

Stars: ✭ 173 (+37.3%)

Mutual labels: pruning, quantization, model-compression

deepvac

PyTorch Project Specification.

Stars: ✭ 507 (+302.38%)

Mutual labels: quantization, tensorrt, onnx

neural-compressor

Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.

Stars: ✭ 666 (+428.57%)

Mutual labels: pruning, quantization, quantization-aware-training

Model Optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

Stars: ✭ 992 (+687.3%)

Mutual labels: pruning, quantization, model-compression

Paddleslim

PaddleSlim is an open-source library for deep model compression and architecture search.

Stars: ✭ 677 (+437.3%)

Mutual labels: pruning, quantization, model-compression

sparsify

Easy-to-use UI for automatically sparsifying neural networks and creating sparsification recipes for better inference performance and a smaller footprint

Stars: ✭ 138 (+9.52%)

Mutual labels: pruning, quantization, onnx

ATMC

[NeurIPS'2019] Shupeng Gui, Haotao Wang, Haichuan Yang, Chen Yu, Zhangyang Wang, Ji Liu, “Model Compression with Adversarial Robustness: A Unified Optimization Framework”

Stars: ✭ 41 (-67.46%)

Mutual labels: pruning, quantization, model-compression

Distiller

Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller

Stars: ✭ 3,760 (+2884.13%)

Mutual labels: pruning, quantization, onnx

Awesome Ai Infrastructures

Infrastructures™ for Machine Learning Training/Inference in Production.

Stars: ✭ 223 (+76.98%)

Mutual labels: pruning, quantization, model-compression

Ntagger

reference pytorch code for named entity tagging

Stars: ✭ 58 (-53.97%)

Mutual labels: pruning, quantization

Awesome Edge Machine Learning

A curated list of awesome edge machine learning resources, including research papers, inference engines, challenges, books, meetups and others.

Stars: ✭ 139 (+10.32%)

Mutual labels: pruning, quantization

Awesome Pruning

A curated list of neural network pruning resources.

Stars: ✭ 1,017 (+707.14%)

Mutual labels: pruning, model-compression

Model compression

PyTorch Model Compression

Stars: ✭ 150 (+19.05%)

Mutual labels: pruning, quantization

sparsezoo

Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

Stars: ✭ 264 (+109.52%)

Mutual labels: pruning, quantization

Torch Pruning

A pytorch pruning toolkit for structured neural network pruning and layer dependency maintaining.

Stars: ✭ 193 (+53.17%)

Mutual labels: pruning, model-compression

Nncf

PyTorch*-based Neural Network Compression Framework for enhanced OpenVINO™ inference

Stars: ✭ 218 (+73.02%)

Mutual labels: pruning, quantization

BitPack

BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.

Stars: ✭ 36 (-71.43%)

Mutual labels: quantization, model-compression

View All Similar Projects ➔

pytorch自动化模型压缩工具库

介绍

pytorch自动化模型压缩工具库是针对于pytorch模型的基于ONNX静态图结构分析的自动化模型压缩工具库，用户无需理解模型的结构，即可以直接对模型完成剪枝和查找替换等操作，所有的剪枝参数分析和修改以及模块结构搜索均由该工具完成。
工具库包含有两个部分，第一个部分为torchpruner模型分析与修改工具库，通过该工具库可以自动对模型完成模块修改和通道修改等常规操作，而无需关注模型的结构细节。
第二个部分为torchslim模型压缩算法库，包含了模型重参数化、剪枝、感知量化训练等多种模型压缩算法，用户仅需给出需要被压缩的模型并定义好用于训练的hook函数，即可以对模型进行自动压缩，并输出被压缩模型产物。

requirement

onnx>=1.6
onnxruntime>=1.5
pytorch>=1.7
tensorboardX>=1.8
scikit-learn

安装

python setup.py install

总揽

torchpruner

torchpruner为pytorch模型分析与修改工具库，包含了以下功能：
1）自动分析模型结构和剪枝
2）特定模型结构查找与替换

torchslim

torchslim内包含了模型压缩的特定算法：
1）ACNet、CnC、ACBCorner等一系列重参数化方法 2）ResRep模型剪枝方法
3）CSGD模型剪枝方法
4）QAT量化感知训练，并将pytorch模型导出为tenosrrt模型

examples

examples文件夹主要包含了多种支持的模型的测试列表support_model，torchpruner工具库的使用示例以及torchslim工具库的使用示例。
1）support_model：支持的若干种模型
2）torchpruner：使用torchpruner剪枝和模块修改的简单示例
3）torchslim：使用torchslim 在分类模型上的简单示例

torchpruner模型修改

import torch
import torchpruner
import torchvision
#加载模型
model=torchvision.models.resnet50()

#创建ONNXGraph对象，绑定需要被剪枝的模型
graph=torchpruner.ONNXGraph(model)
##build ONNX静态图结构，需要指定输入的张量
graph.build_graph(inputs=(torch.zeros(1,3,224,224),))

#获取conv1模块对应的module
conv1_module=graph.modules['self.conv1']
#剪枝分析
result=conv1_module.cut_analysis(attribute_name='weight',index=[0,1,2,3],dim=0)
#执行剪枝操作
model,context=torchpruner.set_cut(model,result)
#对卷积模块进行剪枝操作

torchslim模型压缩

import torchslim

#predict_function的第一个参数为model，第二个参数为一个batch的data,data已经被放置到了GPU上
config['task_name']='resnet56_prune'
config['epoch']=90
config['lr']=0.1
config['prune_rate']=0.5
config['save_path']="model/save/path"
config['dataset_generator']=dataset_generator
config['predict_function']=predict_function
config['evaluate_function']=evaluate_function
config['calculate_loss_function']=calculate_loss_function

model=torch.load("model/path")

#创建solver
solver=torchslim.pruning.resrep.ResRepSolver(model,config)
#执行压缩
solver.run()

使用说明

常规用法见
examples
详细使用说明见各自文件夹README.md
torchpruner
torchslim

支持模型结构

该工具理论上支持所有复杂结构模型的剪枝操作，然而由于精力有限，仅有部分的模型和结构被测试，其他模型和结构不代表不支持，但未测试。

已测试常用模型

AlexNet
VGGNet系列
ResNet系列
MobileNet系列
ShuffleNet系列
Inception系列
MNASNet系列
Unet系列
FCN
DeepLab V3
ResNet/Unet QAT感知量化训练模型(QDQ节点)

已测试常用结构和操作

Conv/Group Conv/TransposeConv/FC
Pooling/Upsampling
BatchNorm
Relu/Sigmoid
concat/transpose/view
残差结构/倒置残差结构/Inception结构/Unet结构
quantize_per_tensor/dequantize_per_tensor

确定暂不支持模型

FasterRCNN/MaskRCNN

未来重点测试和支持的模型和结构

RNN/LSTM/GRU
Transformer
FasterRCNN

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

THU-MIG / torch-model-compression

Programming Languages

Labels

Projects that are alternatives of or similar to torch-model-compression

pytorch自动化模型压缩工具库

介绍

requirement

安装

总揽

torchpruner

torchslim

examples

torchpruner模型修改

torchslim模型压缩

使用说明

支持模型结构

已测试常用模型

已测试常用结构和操作

确定暂不支持模型

未来重点测试和支持的模型和结构