Normalization Calibration (NorCal) for Long-Tailed Object Detection and Instance Segmentation

Last update: Dec 25, 2022

Related tags

Deep Learning NorCal

Overview

NorCal

Normalization Calibration (NorCal) for Long-Tailed Object Detection and Instance Segmentation

On Model Calibration for Long-Tailed Object Detection and Instance Segmentation.

Advances in Neural Information Processing Systems (NeurIPS), 2021.

Tai-Yu Pan*, Cheng Zhang*, Yandong Li, Hexiang Hu, Dong Xuan, Soravit Changpinyo, Boqing Gong, Wei-Lun Chao.

Introduction

Vanilla models for object detection and instance segmentation suffer from the heavy bias toward detecting frequent objects in the long-tailed setting. Existing methods address this issue mostly during training, e.g., by re-sampling or re-weighting.

In this paper, we investigate a largely overlooked approach -- post-processing calibration of confidence scores. We propose NorCal, Normalized Calibration for long-tailed object detection and instance segmentation, a simple and straightforward recipe that reweighs the predicted scores of each class by its training sample size. We show that separately handling the background class and normalizing the scores over classes for each proposal are keys to achieving superior performance. On the LVIS dataset, NorCal can effectively improve nearly all the baseline models not only on rare classes but also on common and frequent classes. Finally, we conduct extensive analysis and ablation studies to offer insights into various modeling choices and mechanisms of our approach.

Installation

Install Detectron2 following the instructions.

Evaluation

Model evaluation can be done similarly:

cd /path/to/detectron2/projects/NorCal
python train_net.py --config-file configs/lvis_v0.5_mask_rcnn_R_50_FPN.yaml --eval-only MODEL.WEIGHTS /path/to/model_checkpoint TEST.CALIBRATION.GAMMA gamma

Citation

Please cite with the following bibtex if you find it useful.

@inproceedings{pan2021norcal,
  title={On Model Calibration for Long-Tailed Object Detection and Instance Segmentation},
  author={Pan, Tai-Yu and Zhang, Cheng and Li, Yandong and Hu, Hexiang and Xuan, Dong and Changpinyo, Soravit and Gong, Boqing and Chao, Wei-Lun},
  booktitle = {NeurIPS},
  year={2021}
}

Normalization Calibration (NorCal) for Long-Tailed Object Detection and Instance Segmentation

Related tags

Overview

NorCal

Normalization Calibration (NorCal) for Long-Tailed Object Detection and Instance Segmentation

Introduction

Installation

Evaluation

Citation

Owner

Tai-Yu (Daniel) Pan

How to Become More Salient? Surfacing Representation Biases of the Saliency Prediction Model

This is a tensorflow-based rotation detection benchmark, also called AlphaRotate.

A full-fledged version of Pix2Seq

This project provides the code and datasets for 'CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection', CVPR 2019.

This project generates news headlines using a Long Short-Term Memory (LSTM) neural network.

Effect of Different Encodings and Distance Functions on Quantum Instance-based Classifiers

Paper Code：A Self-adaptive Weighted Differential Evolution Approach for Large-scale Feature Selection

Code for Neural-GIF: Neural Generalized Implicit Functions for Animating People in Clothing(ICCV21)

DeepMoCap: Deep Optical Motion Capture using multiple Depth Sensors and Retro-reflectors

Fiddle is a Python-first configuration library particularly well suited to ML applications.

This repo. is an implementation of ACFFNet, which is accepted for in Image and Vision Computing.

phylotorch-bito is a package providing an interface to BITO for phylotorch

TensorFlow implementation of the algorithm in the paper "Decoupled Low-light Image Enhancement"

A simple AI that will give you si ple task and this is made with python

PyTorch implementation of 'Gen-LaneNet: a generalized and scalable approach for 3D lane detection'

Simple Pixelbot for Diablo 2 Resurrected written in python and opencv.

Implementation supporting the ICCV 2017 paper "GANs for Biological Image Synthesis"

Multi-Agent Reinforcement Learning for Active Voltage Control on Power Distribution Networks (MAPDN)

Code for Talk-to-Edit (ICCV2021). Paper: Talk-to-Edit: Fine-Grained Facial Editing via Dialog.

A web porting for NVlabs' StyleGAN2, to facilitate exploring all kinds characteristic of StyleGAN networks