[ICCV 2021] Code release for "Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks"

Last update: Nov 20, 2022

Overview

Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks

By Yikai Wang, Yi Yang, Fuchun Sun, Anbang Yao.

This is the pytorch implementation of our paper "Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks", published in ICCV 2021.

Citation

If you find our code useful for your research, please consider citing:

@inproceedings{wang2021snn,
    title={Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks},
    author={Wang, Yikai and Yang, Yi and Sun, Fuchun and Yao, Anbang},
    booktitle={International Conference on Computer Vision (ICCV)},
    year={2021}
}

Dataset

Following this repository,

Download the ImageNet dataset from http://www.image-net.org.
Then move validation images to labeled subfolders, using the following script.

Requirements:

python3, pytorch 1.4.0, torchvision 0.5.0

Training

(1) Step1: binarizing activations (or you can omit this step by using our Step1 model checkpoint_ba.pth.tar),

Change directory to ./step1,
Run the folowing script,

CUDA_VISIBLE_DEVICES=0,1,2,3 python train.py --data=path/to/ILSVRC2012/  --batch_size=512 --learning_rate=1e-3 --epochs=256 --weight_decay=1e-5

(2) Step2: binarizing weights + activations,

Change directory to ./step2,
Create new folder ./models and copy checkpoint_ba.pth.tar (obtained from Step1) to ./models,
Run the folowing script,

CUDA_VISIBLE_DEVICES=0,1,2,3 python train.py --data=path/to/ILSVRC2012/  --batch_size=512 --learning_rate=1e-3 --epochs=256 --weight_decay=0 --bit-num=5

Comment: --bit-num=5 corresponds to 0.56 bit (bit-num indicates tau in the paper).

Results

This implementation is based on ResNet-18 of ReActNet.

Bit-Width	Top1-Acc	Top5-Acc	#Params	Bit-OPs	Model & Log
1W / 1A	65.7%	86.3%	10.99Mbit	1.677G	Google Drive
0.67W / 1A	63.4%	84.5%	7.324Mbit	0.883G	Google Drive
0.56W / 1A	62.1%	83.8%	6.103Mbit	0.501G	Google Drive
0.44W / 1A	60.7%	82.7%	4.882Mbit	0.297G	Google Drive

License

SNN is released under MIT License.

[ICCV 2021] Code release for "Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks"

Related tags

Overview

Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks

Citation

Dataset

Requirements:

Training

Results

License

Owner

Yikai Wang

PyTorch implementation of Soft-DTW: a Differentiable Loss Function for Time-Series in CUDA

Code for 1st place solution in Sleep AI Challenge SNU Hospital

Leveraging OpenAI's Codex to solve cornerstone problems in Music

OoD Minimum Anomaly Score GAN - Code for the Paper 'OMASGAN: Out-of-Distribution Minimum Anomaly Score GAN for Sample Generation on the Boundary'

Implementation of Multistream Transformers in Pytorch

Make Watson Assistant send messages to your Discord Server

ML models and internal tensors 3D visualizer

Honours project, on creating a depth estimation map from two stereo images of featureless regions

To provide 100 JAX exercises over different sections structured as a course or tutorials to teach and learn for beginners, intermediates as well as experts

【steal piano】GitHub偷情分析工具！

Pytorch Implementation of PointNet and PointNet++++

Randstad Artificial Intelligence Challenge (powered by VGEN). Soluzione proposta da Stefano Fiorucci (anakin87) - primo classificato

Multi Agent Reinforcement Learning for ROS in 2D Simulation Environments

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"

a Pytorch easy re-implement of "YOLOX: Exceeding YOLO Series in 2021"

This is the code for "HyperNeRF: A Higher-Dimensional Representation for Topologically Varying Neural Radiance Fields".

Template repository for managing machine learning research projects built with PyTorch-Lightning

Pytorch Lightning Distributed Accelerators using Ray

Open-source Monocular Python HawkEye for Tennis