Official implementation of the paper ``Unifying Nonlocal Blocks for Neural Networks'' (ICCV'21)

Last update: Dec 14, 2022

Related tags

Deep Learning SNL_ICCV2021

Overview

Spectral Nonlocal Block

Overview

Official implementation of the paper: Unifying Nonlocal Blocks for Neural Networks (ICCV'21)

Spectral View of Nonlocal Block

Our work provide a novel perspective for the model design of non-local blocks called the Spectral View of Non-local. In this view, the non-local block can be seen as operating a set of graph filters on a fully connected weighted graph. Our spectral view can help to therorotivally anaylize exsiting non-local blocks and design novel non-local block with the help of graph signal processing (e.g. the graph neural networks).

Spectral Nonlocal Block

This repository gives the implementation of Spectral Nonlocal Block (SNL) that is theoreotically designed with the help of first-order chebyshev graph convolution. The structure of the SNL is given below:

Two main differences between SNL and exisiting nonlocals, which make SNL can concern the graph spectral:

The SNL using a symmetrical affinity matrix to ensure that the graph laplacian of the fully connected weighted graph is diagonalizable.
The SNL using the normalized laplacian to conform the upper bound of maximum eigenvalue (equal to 2) for arbitrary graph structure.

More novel nonlocal blocks defined with other type graph filters will release soon, for example Cheby Filter, Amma Filter, and the Cayley Filter.

Getting Starte

Requirements

PyTorch >= 0.4.1

Python >= 3.5

torchvision >= 0.2.1

termcolor >= 1.1.0

tensorboardX >= 1.9

opencv >= 3.4

Classification

To train the SNL:

install the conda environment using "env.yml"
Setting --data_dir as the root directory of the dataset in "train_snl.sh"
Setting --dataset as the train/val dataset (cifar10/cifar100/imagenet)
Setting --backbone as the backbone type (we suggest using preresnet for CIFAR and resnet for ImageNet)
Setting --arch as the backbone deepth (we suggest using 20/56 for preresnet and 50 for resnet)
Other parameter such as learning rate, batch size can be found/set in "train_val.py"
run the code by: "sh train_snl.sh"
the training log and checkpoint are saving in "save_model"

Semantic Segmentation

We also give the module/config implementated for semantic segmentation based on mmsegmentation framework, one can regist our SNL block and train our SNL for semantic segmentation (Cityscape) followed their step.

Citation

@InProceedings{Lei_2021_ICCV,
title = {Unifying Nonlocal Blocks for Neural Networks},
author = {Zhu, Lei and She, Qi and Li, Duo and Lu, Yanye and Kang, Xuejing and Hu, Jie and Wang, Changhu},
booktitle = {IEEE International Conference on Computer Vision (ICCV)},
month = {October},
year = {2021}
}

Acknowledgement

This code and our experiments are conducted based on the release code of CGNL / mmsegmentation framework / 3D-ResNet framework. Here we thank for their remarkable works.

Official implementation of the paper ``Unifying Nonlocal Blocks for Neural Networks'' (ICCV'21)

Related tags

Overview

Spectral Nonlocal Block

Overview

Spectral View of Nonlocal Block

Spectral Nonlocal Block

Getting Starte

Requirements

Classification

Semantic Segmentation

Citation

Acknowledgement

Owner

Yolact-keras实例分割模型在keras当中的实现

PyTorch framework, for reproducing experiments from the paper Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks

Hyperopt for solving CIFAR-100 with a convolutional neural network (CNN) built with Keras and TensorFlow, GPU backend

Paper Code：A Self-adaptive Weighted Differential Evolution Approach for Large-scale Feature Selection

Improving Convolutional Networks via Attention Transfer (ICLR 2017)

Official Implementation for the paper DeepFace-EMD: Re-ranking Using Patch-wise Earth Mover’s Distance Improves Out-Of-Distribution Face Identification

CenterNet:Objects as Points目标检测模型在Pytorch当中的实现

Real-time Neural Representation Fusion for Robust Volumetric Mapping

LAVT: Language-Aware Vision Transformer for Referring Image Segmentation

The official implementation code of "PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction."

Official code for paper "Demystifying Local Vision Transformer: Sparse Connectivity, Weight Sharing, and Dynamic Weight"

Drone detection using YOLOv5

Unofficial TensorFlow implementation of the Keyword Spotting Transformer model

DirectVoxGO reconstructs a scene representation from a set of calibrated images capturing the scene.

😮The official implementation of "CoNeRF: Controllable Neural Radiance Fields" 😮

iNAS: Integral NAS for Device-Aware Salient Object Detection

Active learning for Mask R-CNN in Detectron2

Seach Losses of our paper 'Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search', accepted by ICLR 2021.

Source code and data from the RecSys 2020 article "Carousel Personalization in Music Streaming Apps with Contextual Bandits" by W. Bendada, G. Salha and T. Bontempelli

null