All Projects → torch-model-compression → Similar Projects or Alternatives

332 Open source projects that are alternatives of or similar to torch-model-compression

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape

Stars: ✭ 1,232 (+877.78%)

Mutual labels: pruning, quantization, model-compression, onnx

sparsify

Easy-to-use UI for automatically sparsifying neural networks and creating sparsification recipes for better inference performance and a smaller footprint

Stars: ✭ 138 (+9.52%)

Mutual labels: pruning, quantization, onnx

InsightFace-REST

InsightFace REST API for easy deployment of face recognition services with TensorRT in Docker.

Stars: ✭ 308 (+144.44%)

Mutual labels: tensorrt, onnx, tensorrt-conversion

Kd lib

A Pytorch Knowledge Distillation library for benchmarking and extending works in the domains of Knowledge Distillation, Pruning, and Quantization.

Stars: ✭ 173 (+37.3%)

Mutual labels: pruning, quantization, model-compression

deepvac

PyTorch Project Specification.

Stars: ✭ 507 (+302.38%)

Mutual labels: quantization, tensorrt, onnx

Awesome Ai Infrastructures

Infrastructures™ for Machine Learning Training/Inference in Production.

Stars: ✭ 223 (+76.98%)

Mutual labels: pruning, quantization, model-compression

Paddleslim

PaddleSlim is an open-source library for deep model compression and architecture search.

Stars: ✭ 677 (+437.3%)

Mutual labels: pruning, quantization, model-compression

Awesome Ml Model Compression

Awesome machine learning model compression research papers, tools, and learning material.

Stars: ✭ 166 (+31.75%)

Mutual labels: pruning, quantization, model-compression

ATMC

[NeurIPS'2019] Shupeng Gui, Haotao Wang, Haichuan Yang, Chen Yu, Zhangyang Wang, Ji Liu, “Model Compression with Adversarial Robustness: A Unified Optimization Framework”

Stars: ✭ 41 (-67.46%)

Mutual labels: pruning, quantization, model-compression

Distiller

Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller

Stars: ✭ 3,760 (+2884.13%)

Mutual labels: pruning, quantization, onnx

neural-compressor

Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.

Stars: ✭ 666 (+428.57%)

Mutual labels: pruning, quantization, quantization-aware-training

Model Optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

Stars: ✭ 992 (+687.3%)

Mutual labels: pruning, quantization, model-compression

Filter Pruning Geometric Median

Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration (CVPR 2019 Oral)

Stars: ✭ 338 (+168.25%)

Mutual labels: pruning, model-compression

MQBench Quantize

QAT(quantize aware training) for classification with MQBench

Stars: ✭ 29 (-76.98%)

Mutual labels: quantization, qat

Deepstream Project

This is a highly separated deployment project based on Deepstream , including the full range of Yolo and continuously expanding deployment projects such as Ocr.

Stars: ✭ 120 (-4.76%)

Mutual labels: tensorrt, onnx

DS-Net

(CVPR 2021, Oral) Dynamic Slimmable Network

Stars: ✭ 204 (+61.9%)

Mutual labels: pruning, model-compression

mediapipe plus

The purpose of this project is to apply mediapipe to more AI chips.

Stars: ✭ 38 (-69.84%)

Mutual labels: tensorrt, onnx

yolov5 tensorrt int8 tools

tensorrt int8 量化yolov5 onnx模型

Stars: ✭ 105 (-16.67%)

Mutual labels: tensorrt, onnx

sparsezoo

Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

Stars: ✭ 264 (+109.52%)

Mutual labels: pruning, quantization

Nncf

PyTorch*-based Neural Network Compression Framework for enhanced OpenVINO™ inference

Stars: ✭ 218 (+73.02%)

Mutual labels: pruning, quantization

AI-LAB

This repository contains a docker image that I use to develop my artificial intelligence applications in an uncomplicated fashion. Python, TensorFlow, PyTorch, ONNX, Keras, OpenCV, TensorRT, Numpy, Jupyter notebook... 🐋🔥

Stars: ✭ 44 (-65.08%)

Mutual labels: tensorrt, onnx

Torch Pruning

A pytorch pruning toolkit for structured neural network pruning and layer dependency maintaining.

Stars: ✭ 193 (+53.17%)

Mutual labels: pruning, model-compression

ppq

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Stars: ✭ 281 (+123.02%)

Mutual labels: quantization, onnx

YOLOX

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Stars: ✭ 6,570 (+5114.29%)

Mutual labels: tensorrt, onnx

Pytorch Yolov4

PyTorch ,ONNX and TensorRT implementation of YOLOv4

Stars: ✭ 3,690 (+2828.57%)

Mutual labels: tensorrt, onnx

optimum

🏎️ Accelerate training and inference of 🤗 Transformers with easy to use hardware optimization tools

Stars: ✭ 567 (+350%)

Mutual labels: quantization, onnx

BitPack

BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.

Stars: ✭ 36 (-71.43%)

Mutual labels: quantization, model-compression

ZAQ-code

CVPR 2021 : Zero-shot Adversarial Quantization (ZAQ)

Stars: ✭ 59 (-53.17%)

Mutual labels: quantization, model-compression

Model compression

PyTorch Model Compression

Stars: ✭ 150 (+19.05%)

Mutual labels: pruning, quantization

Awesome Edge Machine Learning

A curated list of awesome edge machine learning resources, including research papers, inference engines, challenges, books, meetups and others.

Stars: ✭ 139 (+10.32%)

Mutual labels: pruning, quantization

fastT5

⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.

Stars: ✭ 421 (+234.13%)

Mutual labels: quantization, onnx

mtomo

Multiple types of NN model optimization environments. It is possible to directly access the host PC GUI and the camera to verify the operation. Intel iHD GPU (iGPU) support. NVIDIA GPU (dGPU) support.

Stars: ✭ 24 (-80.95%)

Mutual labels: tensorrt, onnx

vs-mlrt

Efficient ML Filter Runtimes for VapourSynth (with built-in support for waifu2x, DPIR, RealESRGANv2, and Real-CUGAN)

Stars: ✭ 34 (-73.02%)

Mutual labels: tensorrt, onnx

bert-squeeze

🛠️ Tools for Transformers compression using PyTorch Lightning ⚡

Stars: ✭ 56 (-55.56%)

Mutual labels: pruning, quantization

Regularization-Pruning

[ICLR'21] PyTorch code for our paper "Neural Pruning via Growing Regularization"

Stars: ✭ 44 (-65.08%)

Mutual labels: pruning, model-compression

SSD-Pruning-and-quantization

Pruning and quantization for SSD. Model compression.

Stars: ✭ 19 (-84.92%)

Mutual labels: pruning, quantization

person-detection

TensorRT person tracking RFBNet300

Stars: ✭ 30 (-76.19%)

Mutual labels: tensorrt, onnx

onnx2tensorRt

tensorRt-inference darknet2onnx pytorch2onnx mxnet2onnx python version

Stars: ✭ 14 (-88.89%)

Mutual labels: tensorrt, onnx

ONNX-Runtime-with-TensorRT-and-OpenVINO

Docker scripts for building ONNX Runtime with TensorRT and OpenVINO in manylinux environment

Stars: ✭ 15 (-88.1%)

Mutual labels: tensorrt, onnx

Tengine

Tengine is a lite, high performance, modular inference engine for embedded device

Stars: ✭ 4,012 (+3084.13%)

Mutual labels: tensorrt, onnx

Ntagger

reference pytorch code for named entity tagging

Stars: ✭ 58 (-53.97%)

Mutual labels: pruning, quantization

Soft Filter Pruning

Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks

Stars: ✭ 291 (+130.95%)

Mutual labels: pruning, model-compression

Awesome Pruning

A curated list of neural network pruning resources.

Stars: ✭ 1,017 (+707.14%)

Mutual labels: pruning, model-compression

Tf2

An Open Source Deep Learning Inference Engine Based on FPGA

Stars: ✭ 113 (-10.32%)

Mutual labels: quantization, model-compression

Hawq

Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.

Stars: ✭ 108 (-14.29%)

Mutual labels: quantization, model-compression

Awesome Automl And Lightweight Models

A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures, 3.) Model Compression, Quantization and Acceleration, 4.) Hyperparameter Optimization, 5.) Automated Feature Engineering.

Stars: ✭ 691 (+448.41%)

Mutual labels: quantization, model-compression

Pinto model zoo

A repository that shares tuning results of trained models generated by TensorFlow / Keras. Post-training quantization (Weight Quantization, Integer Quantization, Full Integer Quantization, Float16 Quantization), Quantization-aware training. TensorFlow Lite. OpenVINO. CoreML. TensorFlow.js. TF-TRT. MediaPipe. ONNX. [.tflite,.h5,.pb,saved_model,tfjs,tftrt,mlmodel,.xml/.bin, .onnx]

Stars: ✭ 634 (+403.17%)

Mutual labels: quantization, onnx

Pretrained Language Model

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Stars: ✭ 2,033 (+1513.49%)

Mutual labels: quantization, model-compression

SViTE

[NeurIPS'21] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang, Zhangyang Wang

Stars: ✭ 50 (-60.32%)

Mutual labels: pruning, model-compression

Aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Stars: ✭ 453 (+259.52%)

Mutual labels: pruning, quantization

Awesome Emdl

Embedded and mobile deep learning research resources

Stars: ✭ 554 (+339.68%)

Mutual labels: pruning, quantization

nanodet tensorrt int8

nanodet int8 量化，实测推理2ms一帧！

Stars: ✭ 37 (-70.63%)

Mutual labels: tensorrt

Tengine-Convert-Tools

Tengine Convert Tool supports converting multi framworks' models into tmfile that suitable for Tengine-Lite AI framework.

Stars: ✭ 89 (-29.37%)

Mutual labels: onnx

Auto-Compression

Automatic DNN compression tool with various model compression and neural architecture search techniques

Stars: ✭ 19 (-84.92%)

Mutual labels: model-compression

torchprune

A research library for pytorch-based neural network pruning, compression, and more.

Stars: ✭ 133 (+5.56%)

Mutual labels: pruning

AgentOCR

一个多语言支持、易使用的 OCR 项目。An easy-to-use OCR project with multilingual support.

Stars: ✭ 98 (-22.22%)

Mutual labels: onnx

yolov5 tensorrt

This is the implementation that supports yolov5s, yolov5m, yolov5l, yolov5x.

Stars: ✭ 32 (-74.6%)

Mutual labels: tensorrt

TF2DeepFloorplan

TF2 Deep FloorPlan Recognition using a Multi-task Network with Room-boundary-Guided Attention. Enable tensorboard, quantization, flask, tflite, docker, github actions and google colab.

Stars: ✭ 98 (-22.22%)

Mutual labels: quantization

Selecsls Pytorch

Reference ImageNet implementation of SelecSLS CNN architecture proposed in the SIGGRAPH 2020 paper "XNect: Real-time Multi-Person 3D Motion Capture with a Single RGB Camera". The repository also includes code for pruning the model based on implicit sparsity emerging from adaptive gradient descent methods, as detailed in the CVPR 2019 paper "On implicit filter level sparsity in Convolutional Neural Networks".

Stars: ✭ 251 (+99.21%)

Mutual labels: pruning

GAN-LTH

[ICLR 2021] "GANs Can Play Lottery Too" by Xuxi Chen, Zhenyu Zhang, Yongduo Sui, Tianlong Chen

Stars: ✭ 24 (-80.95%)

Mutual labels: pruning

1-60 of 332 similar projects

›

next*5