The PyTorch implementation of DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision.

Last update: Oct 23, 2021

Related tags

Deep Learning SCOT

Overview

DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision

The PyTorch implementation of DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision.

@article{lan2021discobox,
  title={DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision},
  author={Lan, Shiyi and Yu, Zhiding and Choy, Christopher and Radhakrishnan, Subhashree and Liu, Guilin and Zhu, Yuke and Davis, Larry S and Anandkumar, Anima},
  journal={arXiv preprint arXiv:2105.06464},
  year={2021}
}

[ Paper ]

Introduction

This repository is the implementation of evaluating DiscoBox on PF-Pascal dataset.

This implementation is based on SCOT

Installation

Conda environment settings

conda create -n scot python=3.6
conda activate scot

cat /usr/local/cuda/version.txt
conda install pytorch=1.4.0 torchvision cudatoolkit=10.0 -c pytorch (if CUDA 10) 
conda install pytorch=1.4.0 torchvision cudatoolkit=9.0 -c pytorch (if CUDA 9) 

conda install -c anaconda scikit-image
conda install -c anaconda pandas
conda install -c anaconda requests
pip install gluoncv-torch

Pretrained Weights

Download the pretrained weights from this link.

Evaluation

Results on PF-PASCAL with res101:

    python evaluate_map_CAM.py --dataset pfpascal --thres img --backbone resnet101 --hyperpixel '(2,22,24,25,27,28,29)' --sim OTGeo --exp1 1.0 --exp2 0.5 --eps 0.05 --gpu 0 --classmap 1 --split test --alpha 0.05
    python evaluate_map_CAM.py --dataset pfpascal --thres img --backbone resnet101 --hyperpixel '(2,22,24,25,27,28,29)' --sim OTGeo --exp1 1.0 --exp2 0.5 --eps 0.05 --gpu 0 --classmap 1 --split test --alpha 0.10
    python evaluate_map_CAM.py --dataset pfpascal --thres img --backbone resnet101 --hyperpixel '(2,22,24,25,27,28,29)' --sim OTGeo --exp1 1.0 --exp2 0.5 --eps 0.05 --gpu 0 --classmap 1 --split test --alpha 0.15

All results

Method	Backbone	[email protected]	[email protected]	[email protected]
SCOT	ResNet-101	63.2	85.4	92.8
DiscoBox	ResNet-101	62.7	85.6	93.5

Training

Please look at this link for more details

The PyTorch implementation of DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision.

Related tags

Overview

DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision

Introduction

Installation

Conda environment settings

Pretrained Weights

Evaluation

All results

Training

Owner

Shiyi Lan

Curriculum Domain Adaptation for Semantic Segmentation of Urban Scenes, ICCV 2017

Code for 'Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning' (AAAI 2022)

Relative Uncertainty Learning for Facial Expression Recognition

Potato Disease Classification - Training, Rest APIs, and Frontend to test.

Exploring Versatile Prior for Human Motion via Motion Frequency Guidance (3DV2021)

Cross-Document Coreference Resolution

Official source code of paper 'IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo'

official Pytorch implementation of ICCV 2021 paper FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting.

A web porting for NVlabs' StyleGAN2, to facilitate exploring all kinds characteristic of StyleGAN networks

AAAI-22 paper: SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning

A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.

Analyzing basic network responses to novel classes

Predicting Event Memorability from Contextual Visual Semantics

Physics-Aware Training (PAT) is a method to train real physical systems with backpropagation.

TCPNet - Temporal-attentive-Covariance-Pooling-Networks-for-Video-Recognition

Source code for Fathony, Sahu, Willmott, & Kolter, "Multiplicative Filter Networks", ICLR 2021.

Zen-NAS: A Zero-Shot NAS for High-Performance Deep Image Recognition

AI-UPV at IberLEF-2021 EXIST task: Sexism Prediction in Spanish and English Tweets Using Monolingual and Multilingual BERT and Ensemble Models

A `Neural = Symbolic` framework for sound and complete weighted real-value logic

Original Pytorch Implementation of FLAME: Facial Landmark Heatmap Activated Multimodal Gaze Estimation