Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth

Last update: Dec 07, 2022

Related tags

Overview

Instance segmentation by jointly optimizing spatial embeddings and clustering bandwidth

This codebase implements the loss function described in:

Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth Davy Neven, Bert De Brabandere, Marc Proesmans, and Luc Van Gool Conference on Computer Vision and Pattern Recognition (CVPR), june 2019

Our network architecture is a multi-branched version of ERFNet and uses the Lovasz-hinge loss for maximizing the IoU of each instance.

License

This software is released under a creative commons license which allows for personal and research use only. For a commercial license please contact the authors. You can view a license summary here.

Getting started

This codebase showcases the proposed loss function on car instance segmentation using the Cityscapes dataset.

Prerequisites

Dependencies:

Pytorch 1.1
Python 3.6.8 (or higher)
Cityscapes + scripts (if you want to evaluate the model)

Training

Training consists out of 2 steps. We first train on 512x512 crops around each object, to avoid computation on background patches. Afterwards, we finetune on larger patches (1024x1024) to account for bigger objects and background features which are not present in the smaller crops.

To generate these crops do the following:

$ CITYSCAPES_DIR=/path/to/cityscapes/ python utils/generate_crops.py

Afterwards start training:

$ CITYSCAPES_DIR=/path/to/cityscapes/ python train.py

Different options can be modified in train_config.py, e.g. to visualize set display=True.

Testing

You can download a pretrained model here. Save this file in the src/pretrained_models/ or adapt the test_config.py file.

To test the model on the Cityscapes validation set run:

$ CITYSCAPES_DIR=/path/to/cityscapes/ python test.py

The pretrained model gets 56.4 AP on the car validation set.

Acknowledgement

This work was supported by Toyota, and was carried out at the TRACE Lab at KU Leuven (Toyota Research on Automated Cars in Europe - Leuven)

Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth

Related tags

Overview

Instance segmentation by jointly optimizing spatial embeddings and clustering bandwidth

License

Getting started

Prerequisites

Training

Testing

Acknowledgement

Owner

Official repo for SemanticGAN https://nv-tlabs.github.io/semanticGAN/

Real-time analysis of intracranial neurophysiology recordings.

Out-of-Distribution Generalization of Chest X-ray Using Risk Extrapolation

Jittor Medical Segmentation Lib -- The assignment of Pattern Recognition course (2021 Spring) in Tsinghua University

ViSD4SA, a Vietnamese Span Detection for Aspect-based sentiment analysis dataset

A machine learning package for streaming data in Python. The other ancestor of River.

Awesome Weak-Shot Learning

An Image compression simulator that uses Source Extractor and Monte Carlo methods to examine the post compressive effects different compression algorithms have.

Least Square Calibration for Peer Reviews

Speech Recognition is an important feature in several applications used such as home automation, artificial intelligence

Compute execution plan: A DAG representation of work that you want to get done. Individual nodes of the DAG could be simple python or shell tasks or complex deeply nested parallel branches or embedded DAGs themselves.

Time-Optimal Planning for Quadrotor Waypoint Flight

Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation

Code for the paper One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation, CVPR 2021.

Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper

Brain Tumor Detection with Tensorflow Neural Networks.

The `rtdl` library + The official implementation of the paper

Easy to use Python camera interface for NVIDIA Jetson

Subgraph Based Learning of Contextual Embedding

Repository for the paper "Exploring the Sensory Spaces of English Perceptual Verbs in Natural Language Data"