Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [CVPR 2021]

Last update: Dec 01, 2022

Overview

Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [BCNet, CVPR 2021]

This is the official pytorch implementation of BCNet built on the open-source detectron2.

Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers
Lei Ke, Yu-Wing Tai, Chi-Keung Tang
CVPR 2021

Two-stage instance segmentation with state-of-the-art performance.
Image formation as composition of two overlapping layers.
Bilayer decoupling for the occluder and occludee.
Efficacy on both the FCOS and Faster R-CNN detectors.

Under construction. Our code and pretrained model will be fully released in two months.

Visualization of Occluded Objects

Qualitative instance segmentation results of our BCNet, using ResNet-101-FPN and Faster R-CNN detector. The bottom row visualizes squared heatmap of contour and mask predictions by the two GCN layers for the occluder and occludee in the same ROI region specified by the red bounding box, which also makes the final segmentation result of BCNet more explainable than previous methods.

Qualitative instance segmentation results of our BCNet, using ResNet-101-FPN and FCOS detector.

Results on COCO test-dev

(Check Table 8 of the paper for full results, all methods are trained on COCO train2017)

Detector	Backbone	Method	mAP(mask)
Faster R-CNN	ResNet-50 FPN	Mask R-CNN	34.2
Faster R-CNN	ResNet-50 FPN	MS R-CNN	35.6
Faster R-CNN	ResNet-50 FPN	PointRend	36.3
Faster R-CNN	ResNet-50 FPN	PANet	36.6
Faster R-CNN	ResNet-50 FPN	BCNet	38.4
Faster R-CNN	ResNet-101 FPN	Mask R-CNN	36.1
Faster R-CNN	ResNet-101 FPN	BMask R-CNN	37.7
Faster R-CNN	ResNet-101 FPN	MS R-CNN	38.3
Faster R-CNN	ResNet-101 FPN	BCNet	39.8, [Pretrained Model]
FCOS	ResNet-101 FPN	SipMask	37.8
FCOS	ResNet-101 FPN	BlendMask	38.4
FCOS	ResNet-101 FPN	CenterMask	38.3
FCOS	ResNet-101 FPN	BCNet	39.6, [Pretrained Model]

Introduction

Segmenting highly-overlapping objects is challenging, because typically no distinction is made between real object contours and occlusion boundaries. Unlike previous two-stage instance segmentation methods, BCNet models image formation as composition of two overlapping layers, where the top GCN layer detects the occluding objects (occluder) and the bottom GCN layer infers partially occluded instance (occludee). The explicit modeling of occlusion relationship with bilayer structure naturally decouples the boundaries of both the occluding and occluded instances, and considers the interaction between them during mask regression. We validate the efficacy of bilayer decoupling on both one-stage and two-stage object detectors with different backbones and network layer choices. The network of BCNet is as follows:

Step-by-step Installation

conda create -n bcnet python=3.7 -y
source activate bcnet
 
conda install pytorch==1.4.0 torchvision==0.5.0 cudatoolkit=10.1 -c pytorch
 
# FCOS and coco api and visualization dependencies
pip install ninja yacs cython matplotlib tqdm
pip install opencv-python==4.4.0.40
 
export INSTALL_DIR=$PWD
 
# install pycocotools. Please make sure you have installed cython.
cd $INSTALL_DIR
git clone https://github.com/cocodataset/cocoapi.git
cd cocoapi/PythonAPI
python setup.py build_ext install
 
# install BCNet
cd $INSTALL_DIR
git clone https://github.com/lkeab/BCNet.git
cd BCNet/
python3 setup.py build develop
 
unset INSTALL_DIR

Dataset Preparation

Prepare for coco2017 dataset following this instruction. And use our converted mask annotations to replace original annotation file for bilayer decoupling training.

  mkdir -p datasets/coco
  ln -s /path_to_coco_dataset/annotations datasets/coco/annotations
  ln -s /path_to_coco_dataset/train2017 datasets/coco/train2017
  ln -s /path_to_coco_dataset/test2017 datasets/coco/test2017
  ln -s /path_to_coco_dataset/val2017 datasets/coco/val2017

Multi-GPU Training and evaluation on Validation set

bash all.sh

CUDA_VISIBLE_DEVICES=0,1 python3 tools/train_net.py --num-gpus 2 \
	--config-file configs/fcos/fcos_imprv_R_50_FPN_1x.yaml 2>&1 | tee log/train_log.txt

Pretrained Models

TBD

  mkdir pretrained_models
  #And put the downloaded pretrained models in this directory.

Testing on Test-dev

TBD

bash eval.sh

Citations

If you find BCNet useful in your research, please star this repository and consider citing:

@inproceedings{ke2021bcnet,
    author = {Ke, Lei and Tai, Yu-Wing and Tang, Chi-Keung},
    title = {Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers},
    booktitle = {CVPR},
    year = {2021},
}

License

BCNet is released under the MIT license. See LICENSE for additional details. Thanks to the Third Party Libs detectron2

Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [CVPR 2021]

Related tags

Overview

Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [BCNet, CVPR 2021]

Visualization of Occluded Objects

Results on COCO test-dev

Introduction

Step-by-step Installation

Dataset Preparation

Multi-GPU Training and evaluation on Validation set

Pretrained Models

Testing on Test-dev

Citations

License

Owner

Lei Ke

Computational modelling of ray propagation through optical elements using the principles of geometric optics (Ray Tracer)

Interactive Image Segmentation via Backpropagating Refinement Scheme

A small demonstration of using WebDataset with ImageNet and PyTorch Lightning

An official PyTorch implementation of the TKDE paper "Self-Supervised Graph Representation Learning via Topology Transformations".

This project is for a Twitter bot that monitors a bird feeder in my backyard. Any detected birds are identified and posted to Twitter.

Preprossing-loan-data-with-NumPy - In this project, I have cleaned and pre-processed the loan data that belongs to an affiliate bank based in the United States.

Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.

Seach Losses of our paper 'Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search', accepted by ICLR 2021.

dataset for ECCV 2020 "Motion Capture from Internet Videos"

The Fundamental Clustering Problems Suite (FCPS) summaries 54 state-of-the-art clustering algorithms, common cluster challenges and estimations of the number of clusters as well as the testing for cluster tendency.

Bachelor's Thesis in Computer Science: Privacy-Preserving Federated Learning Applied to Decentralized Data

Official implementation of "GS-WGAN: A Gradient-Sanitized Approach for Learning Differentially Private Generators" (NeurIPS 2020)

An implementation of the WHATWG URL Standard in JavaScript

OpenL3: Open-source deep audio and image embeddings

Tensorflow implementation of the paper "HumanGPS: Geodesic PreServing Feature for Dense Human Correspondences", CVPR 2021.

OcclusionFusion: realtime dynamic 3D reconstruction based on single-view RGB-D

Implementation of ResMLP, an all MLP solution to image classification, in Pytorch

This is a repository of our model for weakly-supervised video dense anticipation.

PyImpetus is a Markov Blanket based feature subset selection algorithm that considers features both separately and together as a group in order to provide not just the best set of features but also the best combination of features

Embracing Single Stride 3D Object Detector with Sparse Transformer