Neuron Merging: Compensating for Pruned Neurons (NeurIPS 2020)

Last update: Dec 30, 2022

Related tags

Overview

Neuron Merging: Compensating for Pruned Neurons

Pytorch implementation of Neuron Merging: Compensating for Pruned Neurons, accepted at 34th Conference on Neural Information Processing Systems (NeurIPS 2020).

Requirements

To install requirements:

conda env create -f ./environment.yml

Python environment & main libraries:

python 3.8
pytorch 1.5.0
scikit-learn 0.22.1
torchvision 0.6.0

LeNet-300-100

To test LeNet-300-100 model on FashionMNIST, run:

bash scripts/LeNet_300_100_FashionMNIST.sh -t [model type] -c [criterion] -r [pruning ratio]

You can use three arguments for this script:

model type: original | prune | merge
pruning criterion : l1-norm | l2-norm | l2-GM
pruning ratio : 0.0 ~ 1.0

For example, to test the model after pruning 50% of the neurons with $l_1$-norm criterion, run:

bash scripts/LeNet_300_100_FashionMNIST.sh -t prune -c l1-norm -r 0.5

To test the model after merging , run:

bash scripts/LeNet_300_100_FashionMNIST.sh -t merge -c l1-norm -r 0.5

VGG-16

To test VGG-16 model on CIFAR-10, run:

bash scripts/VGG16_CIFAR10.sh -t [model type] -c [criterion]

You can use two arguments for this script

model type: original | prune | merge
pruning criterion: l1-norm | l2-norm | l2-GM

As a pretrained model on CIFAR-100 is not included, you must train it first. To train VGG-16 on CIFAR-100, run:

bash scripts/VGG16_CIFAR100_train.sh

All the hyperparameters are as described in the supplementary material.

After training, to test VGG-16 model on CIFAR-100, run:

bash scripts/VGG16_CIFAR100.sh -t [model type] -c [criterion]

You can use two arguments for this script

model type: original | prune | merge
pruning criterion: l1-norm | l2-norm | l2-GM

ResNet

To test ResNet-56 model on CIFAR-10, run:

bash scripts/ResNet56_CIFAR10.sh -t [model type] -c [criterion] -r [pruning ratio]

You can use three arguments for this script

model type: original | prune | merge
pruning method : l1-norm | l2-norm | l2-GM
pruning ratio : 0.0 ~ 1.0

To test WideResNet-40-4 model on CIFAR-10, run:

bash scripts/WideResNet_40_4_CIFAR10.sh -t [model type] -c [criterion] -r [pruning ratio]

You can use three arguments for this script

model type: original | prune | merge
pruning method : l1-norm | l2-norm | l2-GM
pruning ratio : 0.0 ~ 1.0

Results

Our model achieves the following performance on (without fine-tuning) :

Image classification of LeNet-300-100 on FashionMNIST

Baseline Accuracy : 89.80%

Pruning Ratio	Prune ($l_1$-norm)	Merge
50%	88.40%	88.69%
60%	85.17%	86.92%
70%	71.26%	82.75%
80%	66.76	80.02%

Image classification of VGG-16 on CIFAR-10

Baseline Accuracy : 93.70%

Criterion	Prune	Merge
$l_1$-norm	88.70%	93.16%
$l_2$-norm	89.14%	93.16%
$l_2$-GM	87.85%	93.10%

Citation

@inproceedings{kim2020merging,
  title     = {Neuron Merging: Compensating for Pruned Neurons},
  author    = {Kim, Woojeong and Kim, Suhyun and Park, Mincheol and Jeon, Geonseok},
  booktitle = {Advances in Neural Information Processing Systems 33},
  year      = {2020}
}

Neuron Merging: Compensating for Pruned Neurons (NeurIPS 2020)

Related tags

Overview

Neuron Merging: Compensating for Pruned Neurons

Requirements

LeNet-300-100

VGG-16

ResNet

Results

Image classification of LeNet-300-100 on FashionMNIST

Image classification of VGG-16 on CIFAR-10

Citation

Owner

Woojeong Kim

An experimental technique for efficiently exploring neural architectures.

The mini-MusicNet dataset

The reference baseline of final exam for XMU machine learning course

Python scripts for performing object detection with the 1000 labels of the ImageNet dataset in ONNX.

Deep motion transfer

TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks

PyTorch implementation of the Flow Gaussian Mixture Model (FlowGMM) model from our paper

An Empirical Investigation of Model-to-Model Distribution Shifts in Trained Convolutional Filters

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

The 2nd place solution of 2021 google landmark retrieval on kaggle.

Probabilistic Cross-Modal Embedding (PCME) CVPR 2021

An end-to-end framework for mixed-integer optimization with data-driven learned constraints.

Code for the preprint "Well-classified Examples are Underestimated in Classification with Deep Neural Networks"

Near-Duplicate Video Retrieval with Deep Metric Learning

PyTorch implementations of the NeRF model described in "NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis"

EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings

Image-to-Image Translation in PyTorch

General Multi-label Image Classification with Transformers

A minimal implementation of face-detection models using flask, gunicorn, nginx, docker, and docker-compose

PyTorch implementation of Deformable Convolution