This is the implementation of our work Deep Extreme Cut (DEXTR), for object segmentation from extreme points.

Last update: Jan 05, 2023

Related tags

Deep Learning DEXTR-PyTorch

Overview

Deep Extreme Cut (DEXTR)

Visit our project page for accessing the paper, and the pre-computed results.

This is the implementation of our work Deep Extreme Cut (DEXTR), for object segmentation from extreme points.

This code was ported to PyTorch 0.4.0! For the previous version of the code with Pytorch 0.3.1, please checkout this branch.

NEW: Keras with Tensorflow backend implementation also available: DEXTR-KerasTensorflow!

Abstract

This paper explores the use of extreme points in an object (left-most, right-most, top, bottom pixels) as input to obtain precise object segmentation for images and videos. We do so by adding an extra channel to the image in the input of a convolutional neural network (CNN), which contains a Gaussian centered in each of the extreme points. The CNN learns to transform this information into a segmentation of an object that matches those extreme points. We demonstrate the usefulness of this approach for guided segmentation (grabcut-style), interactive segmentation, video object segmentation, and dense segmentation annotation. We show that we obtain the most precise results to date, also with less user input, in an extensive and varied selection of benchmarks and datasets.

Installation

The code was tested with Miniconda and Python 3.6. After installing the Miniconda environment:

Clone the repo:

git clone https://github.com/scaelles/DEXTR-PyTorch
cd DEXTR-PyTorch

Install dependencies:

conda install pytorch torchvision -c pytorch
conda install matplotlib opencv pillow scikit-learn scikit-image

Download the model by running the script inside models/:
```
cd models/
chmod +x download_dextr_model.sh
./download_dextr_model.sh
cd ..
```
The default model is trained on PASCAL VOC Segmentation train + SBD (10582 images). To download models trained on PASCAL VOC Segmentation train or COCO, please visit our project page, or keep scrolling till the end of this README.
To try the demo version of DEXTR, please run:
```
python demo.py
```

If installed correctly, the result should look like this:

To train and evaluate DEXTR on PASCAL (or PASCAL + SBD), please follow these additional steps:

Install tensorboard (integrated with PyTorch).
```
pip install tensorboard tensorboardx
```

Download the pre-trained PSPNet model for semantic segmentation, taken from this repository.

cd models/
chmod +x download_pretrained_psp_model.sh
./download_pretrained_psp_model.sh
cd ..

Set the paths in mypath.py, so that they point to the location of PASCAL/SBD dataset.
Run python train_pascal.py, after changing the default parameters, if necessary (eg. gpu_id).

Enjoy!!

Pre-trained models

You can use the following DEXTR models under MIT license as pre-trained on:

PASCAL + SBD, trained on PASCAL VOC Segmentation train + SBD (10582 images). Achieves mIoU of 91.5% on PASCAL VOC Segmentation val.
PASCAL, trained on PASCAL VOC Segmentation train (1464 images). Achieves mIoU of 90.5% on PASCAL VOC Segmentation val.
COCO, trained on COCO train 2014 (82783 images). Achieves mIoU of 87.8% on PASCAL VOC Segmentation val.

Citation

If you use this code, please consider citing the following papers:

@Inproceedings{Man+18,
  Title          = {Deep Extreme Cut: From Extreme Points to Object Segmentation},
  Author         = {K.K. Maninis and S. Caelles and J. Pont-Tuset and L. {Van Gool}},
  Booktitle      = {Computer Vision and Pattern Recognition (CVPR)},
  Year           = {2018}
}

@InProceedings{Pap+17,
  Title          = {Extreme clicking for efficient object annotation},
  Author         = {D.P. Papadopoulos and J. Uijlings and F. Keller and V. Ferrari},
  Booktitle      = {ICCV},
  Year           = {2017}
}

We thank the authors of pytorch-deeplab-resnet for making their PyTorch re-implementation of DeepLab-v2 available!

If you encounter any problems please contact us at {kmaninis, scaelles}@vision.ee.ethz.ch.

This is the implementation of our work Deep Extreme Cut (DEXTR), for object segmentation from extreme points.

Related tags

Overview

Deep Extreme Cut (DEXTR)

This code was ported to PyTorch 0.4.0! For the previous version of the code with Pytorch 0.3.1, please checkout this branch.

NEW: Keras with Tensorflow backend implementation also available: DEXTR-KerasTensorflow!

Abstract

Installation

Pre-trained models

Citation

Owner

Sergi Caelles

StyleGAN2 - Official TensorFlow Implementation

Code for "Adversarial attack by dropping information." (ICCV 2021)

Pytorch Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic

CR-FIQA: Face Image Quality Assessment by Learning Sample Relative Classifiability

source code for 'Finding Valid Adjustments under Non-ignorability with Minimal DAG Knowledge' by A. Shah, K. Shanmugam, K. Ahuja

object detection; robust detection; ACM MM21 grand challenge; Security AI Challenger Phase VII

Fast convergence of detr with spatially modulated co-attention

IAST: Instance Adaptive Self-training for Unsupervised Domain Adaptation (ECCV 2020)

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

Code for CMaskTrack R-CNN (proposed in Occluded Video Instance Segmentation)

Pytorch implementation of Rosca, Mihaela, et al. "Variational Approaches for Auto-Encoding Generative Adversarial Networks."

The official PyTorch code implementation of "Personalized Trajectory Prediction via Distribution Discrimination" in ICCV 2021.

USAD - UnSupervised Anomaly Detection on multivariate time series

AAAI 2022: Stationary diffusion state neural estimation

Implementation for Stankevičiūtė et al. "Conformal time-series forecasting", NeurIPS 2021.

DeiT: Data-efficient Image Transformers

TensorFlow GNN is a library to build Graph Neural Networks on the TensorFlow platform.

DAFNe: A One-Stage Anchor-Free Deep Model for Oriented Object Detection

Removing Inter-Experimental Variability from Functional Data in Systems Neuroscience

Multi-Stage Spatial-Temporal Convolutional Neural Network (MS-GCN)