Repository for "Exploring Sparsity in Image Super-Resolution for Efficient Inference", CVPR 2021

Last update: Dec 26, 2022

Related tags

Deep Learning SMSR

Overview

SMSR

Reposity for "Exploring Sparsity in Image Super-Resolution for Efficient Inference"

[arXiv]

Highlights

Locate and skip redundant computation in SR networks at a fine-grained level for efficient inference.
Maintain state-of-the-art performance with significant FLOPs reduction and a speedup on mobile devices.
Efficient implementation of sparse convolution based on original Pytorch APIs for easier migration and deployment.

Network Architecture

Implementation of Sparse Convolution

For easier migration and deployment, we use an efficient implementation of sparse convolution based on original Pytorch APIs rather than the commonly applied CUDA-based implementation. Specifically, sparse features are first extracted from the input, as shown in the following figure. Then, matrix multiplication is executed to produce the output features.

Requirements

Python 3.6
PyTorch == 1.1.0
numpy
skimage
imageio
matplotlib
cv2

Train

Prepare training data

Download DIV2K training data (800 training + 100 validtion images) from DIV2K dataset or SNU_CVLab.
Specify '--dir_data' based on the HR and LR images path. In option.py, '--ext' is set as 'sep_reset', which first convert .png to .npy. If all the training images (.png) are converted to .npy files, then set '--ext sep' to skip converting files.

For more informaiton, please refer to EDSR(PyTorch).

Begin to train

python main.py --model SMSR --save SMSR_X2 --scale 2 --patch_size 96 --batch_size 16

Test

Prepare test data

Download benchmark datasets (e.g., Set5, Set14 and other test sets) and prepare HR/LR images in testsets/benchmark following the example of testsets/benchmark/Set5.

Demo

python main.py --dir_data testsets --data_test Set5 --scale 2 --model SMSR --save SMSR_X2 --pre_train experiment/SMSR_X2/model/model_1000.pt --test_only --save_results

Results

Visualization of Sparse Masks

Citation

@InProceedings{Wang2020Exploring,
  author    = {Wang, Longguang and Dong, Xiaoyu and Wang, Yingqian and Ying, Xinyi and Lin, Zaiping and An, Wei and Guo, Yulan},
  title     = {Exploring Sparsity in Image Super-Resolution for Efficient Inference},
  booktitle = {CVPR},
  year      = {2021},
}

Acknowledgements

This code is built on EDSR (PyTorch). We thank the authors for sharing the codes.

Repository for "Exploring Sparsity in Image Super-Resolution for Efficient Inference", CVPR 2021

Related tags

Overview

SMSR

Highlights

Network Architecture

Implementation of Sparse Convolution

Requirements

Train

Prepare training data

Begin to train

Test

Prepare test data

Demo

Results

Visualization of Sparse Masks

Citation

Acknowledgements

Owner

Longguang Wang

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

Finetune the base 64 px GLIDE-text2im model from OpenAI on your own image-text dataset

Reinforcement learning models in ViZDoom environment

Project dự đoán giá cổ phiếu bằng thuật toán LSTM gồm: code train và code demo

A tensorflow=1.13 implementation of Deconvolutional Networks on Graph Data (NeurIPS 2021)

Research code for the paper "Variational Gibbs inference for statistical estimation from incomplete data".

Repository for the semantic WMI loss

Semantic code search implementation using Tensorflow framework and the source code data from the CodeSearchNet project

Cross Quality LFW: A database for Analyzing Cross-Resolution Image Face Recognition in Unconstrained Environments

Parameterized Explainer for Graph Neural Network

DirectVoxGO reconstructs a scene representation from a set of calibrated images capturing the scene.

From Perceptron model to Deep Neural Network from scratch in Python.

Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)

Norm-based Analysis of Transformer

People movement type classifier with YOLOv4 detection and SORT tracking.

EFENet: Reference-based Video Super-Resolution with Enhanced Flow Estimation

Recall Loss for Semantic Segmentation (This repo implements the paper: Recall Loss for Semantic Segmentation)

Image-retrieval-baseline - MUGE Multimodal Retrieval Baseline

Official Implementation of CVPR 2022 paper: "Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning"

Testing and Estimation of structural breaks in Stata