Official pytorch implementation of Active Learning for deep object detection via probabilistic modeling (ICCV 2021)

Last update: Jan 06, 2023

Overview

Active Learning for Deep Object Detection via Probabilistic Modeling

This repository is the official PyTorch implementation of Active Learning for Deep Object Detection via Probabilistic Modeling, ICCV 2021.

The proposed method is implemented based on the SSD pytorch.

Our approach relies on mixture density networks to estimate, in a single forward pass of a single model, both localization and classification uncertainties, and leverages them in the scoring function for active learning.

Our method performs on par with multiple model-based methods (e.g., ensembles and MC-Dropout). Therefore, our method provides the best trade-off between accuracy and computational cost.

License

To view a NVIDIA Source Code License for this work, visit https://github.com/NVlabs/AL-MDN/blob/main/LICENSE

Requirements

For setup and data preparation, please refer to the README in SSD pytorch.

Code was tested in virtual environment with Python 3+ and Pytorch 1.1.

Training

Make directory mkdir weights and cd weights.
Download the FC-reduced VGG-16 backbone weight in the weights directory, and cd ...
If necessary, change the VOC_ROOT in data/voc0712.py or COCO_ROOT in data/coco.py.
Please refer to data/config.py for configuration.
Run the training code:

# Supervised learning
CUDA_VISIBLE_DEVICES=<GPU_ID> python train_ssd_gmm_supervised_learning.py

# Active learning
CUDA_VISIBLE_DEVICES=<GPU_ID> python train_ssd_gmm_active_learining.py

Evaluation

To evaluate on MS-COCO, change the COCO_ROOT_EVAL in data/coco_eval.py.
Run the evaluation code:

# Evaluation on PASCAL VOC
python eval_voc.py --trained_model <trained weight path>

# Evaluation on MS-COCO
python eval_coco.py --trained_model <trained weight path>

Visualization

Run the visualization code:

python demo.py --trained_model <trained weight path>

Citation

@InProceedings{Choi_2021_ICCV,
    author    = {Choi, Jiwoong and Elezi, Ismail and Lee, Hyuk-Jae and Farabet, Clement and Alvarez, Jose M.},
    title     = {Active Learning for Deep Object Detection via Probabilistic Modeling},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {10264-10273}
}

Official pytorch implementation of Active Learning for deep object detection via probabilistic modeling (ICCV 2021)

Related tags

Overview

Active Learning for Deep Object Detection via Probabilistic Modeling

License

Requirements

Training

Evaluation

Visualization

Citation

Owner

NVIDIA Research Projects

Drone Task1 - Drone Task1 With Python

RuleBERT: Teaching Soft Rules to Pre-Trained Language Models

The "breathing k-means" algorithm with datasets and example notebooks

Code for "Optimizing risk-based breast cancer screening policies with reinforcement learning"

A small demonstration of using WebDataset with ImageNet and PyTorch Lightning

Implementation of "Efficient Regional Memory Network for Video Object Segmentation" (Xie et al., CVPR 2021).

Bottleneck Transformers for Visual Recognition

Wordle Env: A Daily Word Environment for Reinforcement Learning

Pytorch implementation for "Density-aware Chamfer Distance as a Comprehensive Metric for Point Cloud Completion" (NeurIPS 2021)

One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking

Price-Prediction-For-a-Dream-Home - A machine learning based linear regression trained model for house price prediction.

Intrusion Detection System using ensemble learning (machine learning)

Single cell current best practices tutorial case study for the paper:Luecken and Theis, "Current best practices in single-cell RNA-seq analysis: a tutorial"

TransMorph: Transformer for Medical Image Registration

code release for USENIX'22 paper `On the Security Risks of AutoML`

This repository contains code, network definitions and pre-trained models for working on remote sensing images using deep learning

Non-Imaging Transient Reconstruction And TEmporal Search (NITRATES)

Using BERT+Bi-LSTM+CRF

DeiT: Data-efficient Image Transformers

Attack on Confidence Estimation algorithm from the paper "Disrupting Deep Uncertainty Estimation Without Harming Accuracy"