Official implementation for the paper: "Multi-label Classification with Partial Annotations using Class-aware Selective Loss"

Last update: Dec 27, 2022

Related tags

Deep Learning PartialLabelingCSL

Overview

Multi-label Classification with Partial Annotations using Class-aware Selective Loss

Paper | Pretrained models

Official PyTorch Implementation

Emanuel Ben-Baruch, Tal Ridnik, Itamar Friedman, Avi Ben-Cohen, Nadav Zamir, Asaf Noy, Lihi Zelnik-Manor
DAMO Academy, Alibaba Group

Abstract

Large-scale multi-label classification datasets are commonly, and perhaps inevitably, partially annotated. That is, only a small subset of labels are annotated per sample. Different methods for handling the missing labels induce different properties on the model and impact its accuracy. In this work, we analyze the partial labeling problem, then propose a solution based on two key ideas. First, un-annotated labels should be treated selectively according to two probability quantities: the class distribution in the overall dataset and the specific label likelihood for a given data sample. We propose to estimate the class distribution using a dedicated temporary model, and we show its improved efficiency over a naive estimation computed using the dataset's partial annotations. Second, during the training of the target model, we emphasize the contribution of annotated labels over originally un-annotated labels by using a dedicated asymmetric loss. Experiments conducted on three partially labeled datasets, OpenImages, LVIS, and simulated-COCO, demonstrate the effectiveness of our approach. Specifically, with our novel selective approach, we achieve state-of-the-art results on OpenImages dataset. Code will be made available.

Class-aware Selective Approach

An overview of our approach is summarized in the following figure:

Loss Implementation

Our loss consists of a selective approach for adjusting the training mode for each class individualy and a partial asymmetric loss.

An implementation of the Class-aware Selective Loss (CSL) can be found here.

class PartialSelectiveLoss(nn.Module)

Pretrained Models

We provide models pretrained on the OpenImages datasset with different modes and architectures:

Model	Architecture	Link	mAP
Ignore	TResNet-M	link	85.38
Negative	TResNet-M	link	85.85
Selective (CSL)	TResNet-M	link	86.72
Selective (CSL)	TResNet-L	link	87.34

Inference Code (Demo)

We provide inference code, that demonstrate how to load the model, pre-process an image and do inference. Example run of OpenImages model (after downloading the relevant model):

python infer.py  \
--dataset_type=OpenImages \
--model_name=tresnet_m \
--model_path=./models_local/mtresnet_opim_86.72.pth \
--pic_path=./pics/10162266293_c7634cbda9_o.jpg \
--input_size=448

Result Examples

Training Code

Training code is provided in (train.py). Also, code for simulating partial annotation for the MS-COCO dataset is available (here). In particular, two "partial" simulation schemes are implemented: fix-per-class(FPC) and random-per-sample (RPS).

FPC: For each class, we randomly sample a fixed number of positive annotations and the same number of negative annotations. The rest of the annotations are dropped.
RPA: We omit each annotation with probability p.

Pretrained weights using the ImageNet-21k dataset can be found here: link
Pretrained weights using the ImageNet-1k dataset can be found here: link

Example of training with RPS simulation:

--data=/mnt/datasets/COCO/COCO_2014
--model-path=models/pretrain/mtresnet_21k
--gamma_pos=0
--gamma_neg=4
--gamma_unann=4
--simulate_partial_type=rps
--simulate_partial_param=0.5
--partial_loss_mode=selective
--likelihood_topk=5
--prior_threshold=0.5
--prior_path=./outputs/priors/prior_fpc_1000.csv

Example of training with FPC simulation:

--data=/mnt/datasets/COCO/COCO_2014
--model-path=models/pretrain/mtresnet_21k
--gamma_pos=0
--gamma_neg=4
--gamma_unann=4
--simulate_partial_type=fpc
--simulate_partial_param=1000
--partial_loss_mode=selective
--likelihood_topk=5
--prior_threshold=0.5
--prior_path=./outputs/priors/prior_fpc_1000.csv

Typical Training Results

FPC (1,000) simulation scheme:

Model	mAP
Ignore, CE	76.46
Negative, CE	81.24
Negative, ASL (4,1)	81.64
CSL - Selective, P-ASL(4,3,1)	83.44

RPS (0.5) simulation scheme:

Model	mAP
Ignore, CE	84.90
Negative, CE	81.21
Negative, ASL (4,1)	81.91
CSL- Selective, P-ASL(4,1,1)	85.21

Estimating the Class Distribution

The training code contains also the procedure for estimting the class distribution from the data. Our approach enables to rank the classes based on training a temporary model usinig the Ignore mode. link

Top 10 classes:

Method	Top 10 ranked classes
Original	'person', 'chair', 'car', 'dining table', 'cup', 'bottle', 'bowl', 'handbag', 'truck', 'backpack'
Estiimate (Ignore mode)	'person', 'chair', 'handbag', 'cup', 'bench', 'bottle', 'backpack', 'car', 'cell phone', 'potted plant'
Estimate (Negative mode)	'kite' 'truck' 'carrot' 'baseball glove' 'tennis racket' 'remote' 'cat' 'tie' 'horse' 'boat'

Citation

@misc{benbaruch2021multilabel,
      title={Multi-label Classification with Partial Annotations using Class-aware Selective Loss}, 
      author={Emanuel Ben-Baruch and Tal Ridnik and Itamar Friedman and Avi Ben-Cohen and Nadav Zamir and Asaf Noy and Lihi Zelnik-Manor},
      year={2021},
      eprint={2110.10955},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Acknowledgements

Several images from OpenImages dataset are used in this project. ֿ
Some components of this code implementation are adapted from the repository https://github.com/Alibaba-MIIL/ASL.

Official implementation for the paper: "Multi-label Classification with Partial Annotations using Class-aware Selective Loss"

Related tags

Overview

Multi-label Classification with Partial Annotations using Class-aware Selective Loss

Class-aware Selective Approach

Loss Implementation

Pretrained Models

Inference Code (Demo)

Result Examples

Training Code

Typical Training Results

FPC (1,000) simulation scheme:

RPS (0.5) simulation scheme:

Estimating the Class Distribution

Top 10 classes:

Citation

Acknowledgements

Owner

The authors' official PyTorch SigWGAN implementation

Boosting Adversarial Attacks with Enhanced Momentum (BMVC 2021)

Course content and resources for the AIAIART course.

Setup freqtrade/freqUI on Heroku

Code for training and evaluation of the model from "Language Generation with Recurrent Generative Adversarial Networks without Pre-training"

Model-free Vehicle Tracking and State Estimation in Point Cloud Sequences

Doosan robotic arm, simulation, control, visualization in Gazebo and ROS2 for Reinforcement Learning.

HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

PaddleBoBo是基于PaddlePaddle和PaddleSpeech、PaddleGAN等开发套件的虚拟主播快速生成项目

PyTorch implementation of our method for adversarial attacks and defenses in hyperspectral image classification.

PyTorch implementation of SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

Food Drinks and groceries Images Multi Lingual (FooDI-ML) dataset.

🔎 Monitor deep learning model training and hardware usage from your mobile phone 📱

Dynamic Multi-scale Filters for Semantic Segmentation (DMNet ICCV'2019)

This is a repository of our model for weakly-supervised video dense anticipation.

Official PyTorch implementation of "Improving Face Recognition with Large AgeGaps by Learning to Distinguish Children" (BMVC 2021)

This is an implementation for the CVPR2020 paper "Learning Invariant Representation for Unsupervised Image Restoration"

Language-Agnostic Website Embedding and Classification