code for CVPR paper Zero-shot Instance Segmentation

Last update: Dec 13, 2022

Overview

Code for CVPR2021 paper

Zero-shot Instance Segmentation

Code requirements

python: python3.7
nvidia GPU
pytorch1.1.0
GCC >=5.4
NCCL 2
the other python libs in requirement.txt

Install

conda create -n zsi python=3.7 -y
conda activate zsi

conda install pytorch=1.1.0 torchvision=0.3.0 cudatoolkit=10.0 -c pytorch

pip install cython && pip --no-cache-dir install -r requirements.txt
   
python setup.py develop

Dataset prepare

Download the train and test annotations files for zsi from annotations, put all json label file to
```
data/coco/annotations/
```
Download MSCOCO-2014 dataset and unzip the images it to path：
```
data/coco/train2014/
data/coco/val2014/
```

Training:

48/17 split:

   chmod +x tools/dist_train.sh
   ./tools/dist_train.sh configs/zsi/train/zero-shot-mask-rcnn-BARPN-bbox_mask_sync_bg_decoder.py 4

65/15 split:

chmod +x tools/dist_train.sh
./tools/dist_train.sh configs/zsi/train/zero-shot-mask-rcnn-BARPN-bbox_mask_sync_bg_65_15_decoder_notanh.py 4

Inference & Evaluate:

ZSI task:

48/17 split ZSI task:

download 48/17 ZSI model, put it in checkpoints/ZSI_48_17.pth

inference:

chmod +x tools/dist_test.sh
./tools/dist_test.sh configs/zsi/48_17/test/zsi/zero-shot-mask-rcnn-BARPN-bbox_mask_sync_bg_decoder.py checkpoints/ZSI_48_17.pth 4 --json_out results/zsi_48_17.json

our results zsi_48_17.bbox.json and zsi_48_17.segm.json can also downloaded from zsi_48_17_reults.

evaluate:

for zsd performance

python tools/zsi_coco_eval.py results/zsi_48_17.bbox.json --ann data/coco/annotations/instances_val2014_unseen_48_17.json

for zsi performance

python tools/zsi_coco_eval.py results/zsi_48_17.segm.json --ann data/coco/annotations/instances_val2014_unseen_48_17.json --types segm

65/15 split ZSI task:

download 65/15 ZSI model, put it in checkpoints/ZSI_65_15.pth

inference:

chmod +x tools/dist_test.sh
./toools/dist_test.sh configs/zsi/65_15/test/zsi/zero-shot-mask-rcnn-BARPN-bbox_mask_sync_bg_65_15_decoder_notanh.py checkpoints/ZSI_65_15.pth 4 --json_out results/zsi_65_15.json

our results zsi_65_15.bbox.json and zsi_65_15.segm.json can also downloaded from zsi_65_15_reults.

evaluate:

for zsd performance

python tools/zsi_coco_eval.py results/zsi_65_15.bbox.json --ann data/coco/annotations/instances_val2014_unseen_65_15.json

for zsi performance

python tools/zsi_coco_eval.py results/zsi_65_15.segm.json --ann data/coco/annotations/instances_val2014_unseen_65_15.json --types segm

GZSI task:

48/17 split GZSI task:

use the same model file ZSI_48_17.pth in ZSI task

inference:

chmod +x tools/dist_test.sh
./tools/dist_test.sh configs/zsi/48_17/test/gzsi/zero-shot-mask-rcnn-BARPN-bbox_mask_sync_bg_decoder_gzsi.py checkpoints/ZSI_48_17.pth 4 --json_out results/gzsi_48_17.json

our results gzsi_48_17.bbox.json and gzsi_48_17.segm.json can also downloaded from gzsi_48_17_results.

evaluate:

for gzsd

python tools/gzsi_coco_eval.py results/gzsi_48_17.bbox.json --ann data/coco/annotations/instances_val2014_gzsi_48_17.json --gzsi --num-seen-classes 48

for gzsi

python tools/gzsi_coco_eval.py results/gzsi_48_17.segm.json --ann data/coco/annotations/instances_val2014_gzsi_48_17.json --gzsi --num-seen-classes 48 --types segm

65/15 split GZSI task:

use the same model file ZSI_48_17.pth in ZSI task

inference:

chmod +x tools/dist_test.sh
./tools/dist_test.sh configs/zsi/65_15/test/gzsi/zero-shot-mask-rcnn-BARPN-bbox_mask_sync_bg_65_15_decoder_notanh_gzsi.py checkpoints/ZSI_65_15.pth 4 --json_out results/gzsi_65_15.json

our results gzsi_65_15.bbox.json and gzsi_65_15.segm.json can also downloaded from gzsi_65_15_results.

evaluate:

for gzsd

python tools/gzsi_coco_eval.py results/gzsi_65_15.bbox.json --ann data/coco/annotations/instances_val2014_gzsi_65_15.json --gzsd --num-seen-classes 65

for gzsi

python tools/gzsi_coco_eval.py results/gzsi_65_15.segm.json --ann data/coco/annotations/instances_val2014_gzsi_65_15.json --gzsd --num-seen-classes 65 --types segm

License

ZSI is released under MIT License.

Citing

If you use ZSI in your research or wish to refer to the baseline results published here, please use the following BibTeX entries:

@InProceedings{zhengye2021zsi,
  author  =  {Ye, Zheng and Jiahong, Wu and Yongqiag, Qin and Faen, Zhang and Li, Cui},
  title   =  {Zero-shot Instance Segmentation},
  booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  month = {June},
  year = {2021}
}

code for CVPR paper Zero-shot Instance Segmentation

Related tags

Overview

Code for CVPR2021 paper

Zero-shot Instance Segmentation

Code requirements

Install

Dataset prepare

License

Citing

Owner

zhengye

An optimization and data collection toolbox for convenient and fast prototyping of computationally expensive models.

Dataset and codebase for NeurIPS 2021 paper: Exploring Forensic Dental Identification with Deep Learning

The official implementation code of "PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction."

PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)

The code for paper "Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation" which is accepted by AAAI 2022

A collection of scripts I developed for personal and working projects.

Open source person re-identification library in python

基于DouZero定制AI实战欢乐斗地主

Density-aware Single Image De-raining using a Multi-stream Dense Network (CVPR 2018)

A Streamlit component to render ECharts.

Codes for paper "Towards Diverse Paragraph Captioning for Untrimmed Videos". CVPR 2021

Revisiting Global Statistics Aggregation for Improving Image Restoration

Pretty Tensor - Fluent Neural Networks in TensorFlow

PyTorch implementations of Generative Adversarial Networks.

Rendering color and depth images for ShapeNet models.

Final Project for the CS238: Decision Making Under Uncertainty course at Stanford University in Autumn '21.

The codes and related files to reproduce the results for Image Similarity Challenge Track 1.

A naive ROS interface for visualDet3D.

blind SQLIpy sebuah alat injeksi sql yang menggunakan waktu sql untuk mendapatkan sebuah server database.

A python software that can help blind people find things like laptops, phones, etc the same way a guide dog guides a blind person in finding his way.