This is the implementation of our work Deep Extreme Cut (DEXTR), for object segmentation from extreme points.

Last update: Jan 05, 2023

Related tags

Deep Learning DEXTR-PyTorch

Overview

Deep Extreme Cut (DEXTR)

Visit our project page for accessing the paper, and the pre-computed results.

This is the implementation of our work Deep Extreme Cut (DEXTR), for object segmentation from extreme points.

This code was ported to PyTorch 0.4.0! For the previous version of the code with Pytorch 0.3.1, please checkout this branch.

NEW: Keras with Tensorflow backend implementation also available: DEXTR-KerasTensorflow!

Abstract

This paper explores the use of extreme points in an object (left-most, right-most, top, bottom pixels) as input to obtain precise object segmentation for images and videos. We do so by adding an extra channel to the image in the input of a convolutional neural network (CNN), which contains a Gaussian centered in each of the extreme points. The CNN learns to transform this information into a segmentation of an object that matches those extreme points. We demonstrate the usefulness of this approach for guided segmentation (grabcut-style), interactive segmentation, video object segmentation, and dense segmentation annotation. We show that we obtain the most precise results to date, also with less user input, in an extensive and varied selection of benchmarks and datasets.

Installation

The code was tested with Miniconda and Python 3.6. After installing the Miniconda environment:

Clone the repo:

git clone https://github.com/scaelles/DEXTR-PyTorch
cd DEXTR-PyTorch

Install dependencies:

conda install pytorch torchvision -c pytorch
conda install matplotlib opencv pillow scikit-learn scikit-image

Download the model by running the script inside models/:
```
cd models/
chmod +x download_dextr_model.sh
./download_dextr_model.sh
cd ..
```
The default model is trained on PASCAL VOC Segmentation train + SBD (10582 images). To download models trained on PASCAL VOC Segmentation train or COCO, please visit our project page, or keep scrolling till the end of this README.
To try the demo version of DEXTR, please run:
```
python demo.py
```

If installed correctly, the result should look like this:

To train and evaluate DEXTR on PASCAL (or PASCAL + SBD), please follow these additional steps:

Install tensorboard (integrated with PyTorch).
```
pip install tensorboard tensorboardx
```

Download the pre-trained PSPNet model for semantic segmentation, taken from this repository.

cd models/
chmod +x download_pretrained_psp_model.sh
./download_pretrained_psp_model.sh
cd ..

Set the paths in mypath.py, so that they point to the location of PASCAL/SBD dataset.
Run python train_pascal.py, after changing the default parameters, if necessary (eg. gpu_id).

Enjoy!!

Pre-trained models

You can use the following DEXTR models under MIT license as pre-trained on:

PASCAL + SBD, trained on PASCAL VOC Segmentation train + SBD (10582 images). Achieves mIoU of 91.5% on PASCAL VOC Segmentation val.
PASCAL, trained on PASCAL VOC Segmentation train (1464 images). Achieves mIoU of 90.5% on PASCAL VOC Segmentation val.
COCO, trained on COCO train 2014 (82783 images). Achieves mIoU of 87.8% on PASCAL VOC Segmentation val.

Citation

If you use this code, please consider citing the following papers:

@Inproceedings{Man+18,
  Title          = {Deep Extreme Cut: From Extreme Points to Object Segmentation},
  Author         = {K.K. Maninis and S. Caelles and J. Pont-Tuset and L. {Van Gool}},
  Booktitle      = {Computer Vision and Pattern Recognition (CVPR)},
  Year           = {2018}
}

@InProceedings{Pap+17,
  Title          = {Extreme clicking for efficient object annotation},
  Author         = {D.P. Papadopoulos and J. Uijlings and F. Keller and V. Ferrari},
  Booktitle      = {ICCV},
  Year           = {2017}
}

We thank the authors of pytorch-deeplab-resnet for making their PyTorch re-implementation of DeepLab-v2 available!

If you encounter any problems please contact us at {kmaninis, scaelles}@vision.ee.ethz.ch.

This is the implementation of our work Deep Extreme Cut (DEXTR), for object segmentation from extreme points.

Related tags

Overview

Deep Extreme Cut (DEXTR)

This code was ported to PyTorch 0.4.0! For the previous version of the code with Pytorch 0.3.1, please checkout this branch.

NEW: Keras with Tensorflow backend implementation also available: DEXTR-KerasTensorflow!

Abstract

Installation

Pre-trained models

Citation

Owner

Sergi Caelles

YOLOX-Paddle - A reproduction of YOLOX by PaddlePaddle

A knowledge base construction engine for richly formatted data

Final project code: Implementing BicycleGAN, for CIS680 FA21 at University of Pennsylvania

A toolkit for document-level event extraction, containing some SOTA model implementations

Gapmm2: gapped alignment using minimap2 (align transcripts to genome)

Bridging Vision and Language Model

Official repository of the paper Learning to Regress 3D Face Shape and Expression from an Image without 3D Supervision

Stacked Hourglass Network with a Multi-level Attention Mechanism: Where to Look for Intervertebral Disc Labeling

Official code for 'Pixel-wise Energy-biased Abstention Learning for Anomaly Segmentationon Complex Urban Driving Scenes'

This is the implementation of our work Deep Extreme Cut (DEXTR), for object segmentation from extreme points.

Creative Applications of Deep Learning w/ Tensorflow

Official repository of my book: "Deep Learning with PyTorch Step-by-Step: A Beginner's Guide"

This repo is to be freely used by ML devs to check the GAN performances without coding from scratch.

CenterFace(size of 7.3MB) is a practical anchor-free face detection and alignment method for edge devices.

Weakly Supervised Segmentation by Tensorflow.

source code the paper Fast and Robust Iterative Closet Point.

A tiny, pedagogical neural network library with a pytorch-like API.

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

This repository contains the needed resources to build the HIRID-ICU-Benchmark dataset

Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features