Unifying Global-Local Representations in Salient Object Detection with Transformer

Last update: Aug 24, 2022

Related tags

Overview

GLSTR (Global-Local Saliency Transformer)

This is the official implementation of paper "Unifying Global-Local Representations in Salient Object Detection with Transformer" by Sucheng Ren, Qiang Wen, Nanxuan Zhao, Guoqiang Han, Shengfeng He

Prerequisites

The whole training process can be done on eight RTX2080Ti or four RTX3090.

Pytorch 1.6

Datasets

Training Set

We use the training set of DUTS (DUTS-TR) to train our model.

/path/to/DUTS-TR/
   img/
      img1.jpg
   label/
      label1.png

Testing Set

We test our model on the testing set of DUTS, ECSSD, HKU-IS, PASCAL-S, DUT-OMRON, and SOD to test our model.

Training

Download the pretrained transformer backbone on ImageNet.

# input the path to training data and pretrained backbone in train.sh
bash train.sh

Testing

Download the pretrained model from Baidu pan(code: uo0a), Google drive, and put it int ./ckpt/

python test.py

Evaluation

The precomputed saliency maps (DUTS-TE, ECSSD, HKU-IS, PASCAL-S, DUT-OMRON, and SOD) can be found at Baidu pan(code: uo0a), Google drive.

After paper submission, we retrain the model, and the performance is improved. Feel free to use the results of our paper or the precomputed saliency maps.

Contact

If you have any questions, feel free to email Sucheng Ren :) ([email protected])

Citation

Please cite our paper if you think the code and paper are helpful.

@article{ren2021unifying,
  title={Unifying Global-Local Representations in Salient Object Detection with Transformer},
  author={Ren, Sucheng and Wen, Qiang and Zhao, Nanxuan and Han, Guoqiang and He, Shengfeng},
  journal={arXiv preprint arXiv:2108.02759},
  year={2021}
}

Unifying Global-Local Representations in Salient Object Detection with Transformer

Related tags

Overview

GLSTR (Global-Local Saliency Transformer)

Prerequisites

Datasets

Training Set

Testing Set

Training

Testing

Evaluation

Contact

Citation

Owner

A Closer Look at Reference Learning for Fourier Phase Retrieval

Multi-Scale Progressive Fusion Network for Single Image Deraining

Official implementation of NPMs: Neural Parametric Models for 3D Deformable Shapes - ICCV 2021

Training a Resilient Q-Network against Observational Interference, Causal Inference Q-Networks

Semi-supervised Implicit Scene Completion from Sparse LiDAR

Finding all things on-prem Microsoft for password spraying and enumeration.

A bare-bones Python library for quality diversity optimization.

Deep Inertial Prediction (DIPr)

Unofficial pytorch implementation of 'Image Inpainting for Irregular Holes Using Partial Convolutions'

Using CNN to mimic the driver based on training data from Torcs

Model-based Reinforcement Learning Improves Autonomous Racing Performance

RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation

Machine Unlearning with SISA

TensorFlow Implementation of Unsupervised Cross-Domain Image Generation

Current state of supervised and unsupervised depth completion methods

Network Compression via Central Filter

Pytorch Implementation of rpautrat/SuperPoint

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Tensorflow AffordanceNet and AffContext implementations

[SIGGRAPH Asia 2021] Pose with Style: Detail-Preserving Pose-Guided Image Synthesis with Conditional StyleGAN