Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch

Last update: Dec 02, 2022

Overview

[AAAI 2021]DropLoss for Long-Tail Instance Segmentation

[AAAI 2021] DropLoss for Long-Tail Instance Segmentation
Ting-I Hsieh*, Esther Robb*, Hwann-Tzong Chen, Jia-Bin Huang.
Association for the Advancement of Artificial Intelligence (AAAI), 2021

Figure: Measuring the performance tradeoff. Comparison between rare, common, and frequent categories AP for baselines and our method. We visualize the tradeoff for ‘common vs. frequent’ and ‘rare vs. frequent’as a Pareto frontier, where the top-right position indicates an ideal tradeoff between objectives. DropLoss achieves an improved tradeoff between object categories, resulting in higher overall AP.

This project is a pytorch implementation of DropLoss for Long-Tail Instance Segmentation. DropLoss improves long-tail instance segmentation by adaptively removing discouraging gradients to infrequent classes. A majority of the code is modified from facebookresearch/detectron2 and tztztztztz/eql.detectron2.

Progress

Training code.
Evaluation code.
LVIS v1.0 datasets.
Provide checkpoint model.

Installation

Requirements

Linux or macOS with Python = 3.7
PyTorch = 1.4 and torchvision that matches the PyTorch installation. Install them together at pytorch.org to make sure of this
OpenCV (optional but needed for demos and visualization)

Build Detectron2 from Source

gcc & g++ ≥ 5 are required. ninja is recommended for faster build.

After installing them, run:

python -m pip install 'git+https://github.com/facebookresearch/detectron2.git'
# (add --user if you don't have permission)

# Or, to install it from a local clone:
git clone https://github.com/facebookresearch/detectron2.git
python -m pip install -e detectron2


# Or if you are on macOS
CC=clang CXX=clang++ ARCHFLAGS="-arch x86_64" python -m pip install ......

Remove the latest fvcore package and install an older version:

pip uninstall fvcore
pip install fvcore==0.1.1.post200513

LVIS Dataset

Following the instructions of README.md to set up the LVIS dataset.

Training

To train a model with 8 GPUs run:

cd /path/to/detectron2/projects/DropLoss
python train_net.py --config-file configs/droploss_mask_rcnn_R_50_FPN_1x.yaml --num-gpus 8

Evaluation

Model evaluation can be done similarly:

cd /path/to/detectron2/projects/DropLoss
python train_net.py --config-file configs/droploss_mask_rcnn_R_50_FPN_1x.yaml --eval-only MODEL.WEIGHTS /path/to/model_checkpoint

Citing DropLoss

If you use DropLoss, please use the following BibTeX entry.

@inproceedings{DBLP:conf/aaai/Ting21,
  author 	= {Hsieh, Ting-I and Esther Robb and Chen, Hwann-Tzong and Huang, Jia-Bin},
  title     = {DropLoss for Long-Tail Instance Segmentation},
  booktitle = {Proceedings of the Workshop on Artificial Intelligence Safety 2021
               (SafeAI 2021) co-located with the Thirty-Fifth {AAAI} Conference on
               Artificial Intelligence {(AAAI} 2021), Virtual, February 8, 2021},
  year      = {2021}
  }

Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch

Related tags

Overview

[AAAI 2021]DropLoss for Long-Tail Instance Segmentation

Progress

Installation

Requirements

Build Detectron2 from Source

LVIS Dataset

Training

Evaluation

Citing DropLoss

Owner

Tim

The datasets and code of ACL 2021 paper "Aspect-Category-Opinion-Sentiment Quadruple Extraction with Implicit Aspects and Opinions".

Official PyTorch Implementation of Mask-aware IoU and maYOLACT Detector [BMVC2021]

DeepConsensus uses gap-aware sequence transformers to correct errors in Pacific Biosciences (PacBio) Circular Consensus Sequencing (CCS) data.

MPI Interest Group on Algorithms on 1st semester 2021

Python script to download the celebA-HQ dataset from google drive

For IBM Quantum Challenge Africa 2021, 9 September (07:00 UTC) - 20 September (23:00 UTC).

TransVTSpotter: End-to-end Video Text Spotter with Transformer

A framework for the elicitation, specification, formalization and understanding of requirements.

Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

[AAAI 2022] Separate Contrastive Learning for Organs-at-Risk and Gross-Tumor-Volume Segmentation with Limited Annotation

Supervised & unsupervised machine-learning techniques are applied to the database of weighted P4s which admit Calabi-Yau hypersurfaces.

PyTorch implementation of federated learning framework based on the acceleration of global momentum

MANO hand model porting for the GraspIt simulator

Medical Insurance Cost Prediction using Machine earning

Generative Modelling of BRDF Textures from Flash Images [SIGGRAPH Asia, 2021]

VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning

Code for paper "Multi-level Disentanglement Graph Neural Network"

Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style

A minimal implementation of face-detection models using flask, gunicorn, nginx, docker, and docker-compose

Multi-task head pose estimation in-the-wild