Deep Structured Instance Graph for Distilling Object Detectors (ICCV 2021)

Last update: Nov 17, 2022

Related tags

Deep Learning Dsig

Overview

DSIG

Deep Structured Instance Graph for Distilling Object Detectors

Authors: Yixin Chen, Pengguang Chen, Shu Liu, Liwei Wang, Jiaya Jia.

[pdf] [slide] [supp] [bibtex]

This repo provides the implementation of paper "Deep Structured Instance Graph for Distilling Object Detectors"(Dsig) based on detectron2. Specifically, aiming at solving the feature imbalance problem while further excavating the missing relation inside semantic instances, we design a graph whose nodes correspond to instance proposal-level features and edges represent the relation between nodes. We achieve new state-of-the-art results on the COCO object detection task with diverse student-teacher pairs on both one- and two-stage detectors.

Installation

Requirements

Python >= 3.6
Pytorch >= 1.7.0
Torchvision >= 0.8.1
Pycocotools 2.0.2

Follow the install instructions in detectron2, note that in this repo we use detectron2 commit version ff638c931d5999f29c22c1d46a3023e67a5ae6a1. Download COCO dataset and export DETECTRON2_DATASETS=$COCOPATH to direct to COCO dataset. We prepare our pre-trained weights for training in Student-Teacher format, please follow the instructions in Pretrained.

Running

We prepare training configs following the detectron2 format. For training a Faster R-CNN R18-FPN student with a Faster R-CNN R50-FPN teacher on 4 GPUs:

./start_train.sh train projects/Distillation/configs/Distillation-FasterRCNN-R18-R50-dsig-1x.yaml

For testing:

./start_train.sh eval projects/Distillation/configs/Distillation-FasterRCNN-R18-R50-dsig-1x.yaml

For debugging:

./start_train.sh debugtrain projects/Distillation/configs/Distillation-FasterRCNN-R18-R50-dsig-1x.yaml

Results and Models

Faster R-CNN:

Experiment(Student-Teacher)	Schedule	AP	Config	Model
R18-R50	1x	37.25	config	googledrive
R50-R101	1x	40.57	config	googledrive
R101-R152	1x	41.65	config	googledrive
MNV2-R50	1x	34.44	config	googledrive
EB0-R101	1x	37.74	config	googledrive

RetinaNet:

Experiment(Student-Teacher)	Schedule	AP	Config	Model
R18-R50	1x	34.72	config	googledrive
MNV2-R50	1x	32.16	config	googledrive
EB0-R101	1x	34.44	config	googledrive

More models and results will be released soon.

Citation

@inproceedings{chen2021dsig,
    title={Deep Structured Instance Graph for Distilling Object Detectors},
    author={Yixin Chen, Pengguang Chen, Shu Liu, Liwei Wang, and Jiaya Jia},
    booktitle={IEEE International Conference on Computer Vision (ICCV)},
    year={2021},
}

Contact

Please contact [email protected].

Deep Structured Instance Graph for Distilling Object Detectors (ICCV 2021)

Related tags

Overview

DSIG

Installation

Requirements

Running

Results and Models

Citation

Contact

Owner

DV Lab

Meandering In Networks of Entities to Reach Verisimilar Answers

The official implementation of the paper, "SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning"

一个目标检测的通用框架(不需要cuda编译)，支持Yolo全系列(v2~v5)、EfficientDet、RetinaNet、Cascade-RCNN等SOTA网络。

End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)

A Decentralized Omnidirectional Visual-Inertial-UWB State Estimation System for Aerial Swar.

Repository providing a wide range of self-supervised pretrained models for computer vision tasks.

Replication Package for "An Empirical Study of the Effectiveness of an Ensemble of Stand-alone Sentiment Detection Tools for Software Engineering Datasets"

TRACER: Extreme Attention Guided Salient Object Tracing Network implementation in PyTorch

An implementation of the AdaOPS (Adaptive Online Packing-based Search), which is an online POMDP Solver used to solve problems defined with the POMDPs.jl generative interface.

SiT: Self-supervised vIsion Transformer

Invert and perturb GAN images for test-time ensembling

Implementation of GGB color space

This repository contains the implementation of the paper: "Towards Frequency-Based Explanation for Robust CNN"

Code for One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning (AAAI 2022)

Learning to Segment Instances in Videos with Spatial Propagation Network

1st place solution in CCF BDCI 2021 ULSEG challenge

[NeurIPS 2021] Large Scale Learning on Non-Homophilous Graphs: New Benchmarks and Strong Simple Methods

Official Implementation of Swapping Autoencoder for Deep Image Manipulation (NeurIPS 2020)

A repository for the paper "Improved Adversarial Systems for 3D Object Generation and Reconstruction".

MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens