BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.

Last update: Dec 02, 2022

Overview

BitPack

BitPack is a practical tool that can efficiently save quantized neural network models with mixed bitwidth.

Installation

PyTorch version >= 1.4.0
Python version >= 3.5
To install Bitpack simply run:

git clone https://github.com/Zhen-Dong/BitPack.git
cd BitPack

Usage

We can use BitPack pack.py to save integer checkpoints with various bitwidth, and use BitPack unpack.py to load the packed checkpoint, as shown in the demo.
To pack integer values that are saved in floating point format, add --force-pack-fp in the command.
To directly save packed checkpoint in PyTorch, please use save_quantized_state_dict() and load_quantized_state_dict() in pytorch_interface.py. If you don't want to operate jointly on state_dict, then codes inside the for loop of those two functions can be applied on every quantized tensor (ultra low-precision integer tensors) in various quantization frameworks.

Quick Start

BitPack is handy to use on various quantization frameworks. Here we show a demo that applying BitPack to save mixed-precision model generated by HAWQ.

export CUDA_VISIBLE_DEVICES=0
python pack.py --input-int-file quantized_checkpoint.pth.tar --force-pack-fp
python unpack.py --input-packed-file packed_quantized_checkpoint.pth.tar --original-int-file quantized_checkpoint.pth.tar

To get a better sense of how BitPack works, we provide a simple test that compares the original tensor, the packed tensor, and the unpacked tensor in details.

cd bitpack
python bitpack_utils.py

Results of BitPack on ResNet50

Original Precision	Quantization	Original Size(MB)	Packed Size(MB)	Compression Ratio
Floating Point	Mixed-Precision(4bit/8bit)	102	13.8	7.4x
8-bit	Mixed-Precision(2bit/8bit)	26	7.9	3.3x

Special Notes

unpack.py can be used for checking correctness. It loads and unpacks the packed model, and then compares it with the original model.

License

BitPack is released under the MIT license.

BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.

Related tags

Overview

BitPack

Installation

Usage

Quick Start

Results of BitPack on ResNet50

Special Notes

License

Owner

Zhen Dong

Simple node deletion tool for onnx.

WPPNets: Unsupervised CNN Training with Wasserstein Patch Priors for Image Superresolution

Unofficial PyTorch Implementation of Multi-Singer

Supporting code for the Neograd algorithm

Semi-Supervised 3D Hand-Object Poses Estimation with Interactions in Time

Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021

Matlab Python Heuristic Battery Opt - SMOP conversion and manual conversion

Scalable implementation of Lee / Mykland (2012) and Ait-Sahalia / Jacod (2012) Jump tests for noisy high frequency data

Official Implementation of "Learning Disentangled Behavior Embeddings"

Soomvaar is the repo which 🏩 contains different collection of 👨‍💻🚀code in Python and 💫✨Machine 👬🏼 learning algorithms📗📕 that is made during 📃 my practice and learning of ML and Python✨💥

Official PyTorch implementation of "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks"

Analysing poker data from home games with friends

Fast, Attemptable Route Planner for Navigation in Known and Unknown Environments

AI-UPV at IberLEF-2021 EXIST task: Sexism Prediction in Spanish and English Tweets Using Monolingual and Multilingual BERT and Ensemble Models

Source code for our paper "Do Not Trust Prediction Scores for Membership Inference Attacks"

A modular, research-friendly framework for high-performance and inference of sequence models at many scales

Learning Representational Invariances for Data-Efficient Action Recognition

A High-Level Fusion Scheme for Circular Quantities published at the 20th International Conference on Advanced Robotics

Code repository for the paper: Hierarchical Kinematic Probability Distributions for 3D Human Shape and Pose Estimation from Images in the Wild (ICCV 2021)

Learning to Simulate Dynamic Environments with GameGAN (CVPR 2020)