Official implementation of "Dynamic Anchor Learning for Arbitrary-Oriented Object Detection" (AAAI2021).

Last update: Nov 28, 2022

Overview

DAL

This project hosts the official implementation for our AAAI 2021 paper:

Dynamic Anchor Learning for Arbitrary-Oriented Object Detection [arxiv] [comments].

Abstract

In this paper, we propose a dynamic anchor learning (DAL) method, which utilizes the newly deﬁned matching degree to comprehensively evaluate the localization potential of the anchors and carry out a more efﬁcient label assignment process. In this way, the detector can dynamically select high-quality anchors to achieve accurate object detection, and the divergence between classiﬁcation and regression will be alleviated.

Getting Started

The codes build Rotated RetinaNet with the proposed DAL method for rotation object detection. The supported datasets include: DOTA, HRSC2016, ICDAR2013, ICDAR2015, UCAS-AOD, NWPU VHR-10, VOC.

Installation

Insatll requirements:

pip install -r requirements.txt
pip install git+git://github.com/lehduong/torch-warmup-lr.git

Build the Cython and CUDA modules:

cd $ROOT/utils
sh make.sh
cd $ROOT/utils/overlaps_cuda
python setup.py build_ext --inplace

Installation for DOTA_devkit:

cd $ROOT/datasets/DOTA_devkit
sudo apt-get install swig
swig -c++ -python polyiou.i
python setup.py build_ext --inplace

Inference

You can use the following command to test a dataset. Note that weight, img_dir, dataset,hyp should be modified as appropriate.

python demo.py

Train

Move the dataset to the $ROOT directory.
Generate imageset files for daatset division via:

cd $ROOT/datasets
python generate_imageset.py

Modify the configuration file hyp.py and arguments in train.py, then start training:

python train.py

Evaluation

Different datasets use different test methods. For UCAS-AOD/HRSC2016/VOC/NWPU VHR-10, you need to prepare labels in the appropriate format in advance. Take evaluation on HRSC2016 for example:

cd $ROOT/datasets/evaluate
python hrsc2gt.py

then you can conduct evaluation:

python eval.py

Note that :

the script needs to be executed only once, but testing on different datasets needs to be executed again.
the imageset file used in hrsc2gt.py is generated from generate_imageset.py.

Main Results

Method	Dataset	Bbox	Backbone	Input Size	mAP/F1
DAL	DOTA	OBB	ResNet-101	800 x 800	71.78
DAL	UCAS-AOD	OBB	ResNet-101	800 x 800	89.87
DAL	HRSC2016	OBB	ResNet-50	416 x 416	88.60
DAL	ICDAR2015	OBB	ResNet-101	800 x 800	82.4
DAL	ICDAR2013	HBB	ResNet-101	800 x 800	81.3
DAL	NWPU VHR-10	HBB	ResNet-101	800 x 800	88.3
DAL	VOC 2007	HBB	ResNet-101	800 x 800	76.1

Detections

Citation

If you find our work or code useful in your research, please consider citing:

@article{ming2020dynamic,
  title={Dynamic Anchor Learning for Arbitrary-Oriented Object Detection},
  author={Ming, Qi and Zhou, Zhiqiang and Miao, Lingjuan and Zhang, Hongwei and Li, Linhao},
  journal={arXiv preprint arXiv:2012.04150},
  year={2020}
}

If you have any questions, please contact me via issue or email.

Official implementation of "Dynamic Anchor Learning for Arbitrary-Oriented Object Detection" (AAAI2021).

Related tags

Overview

DAL

Abstract

Getting Started

Installation

Inference

Train

Evaluation

Main Results

Detections

Citation

Owner

ming71

Unofficial implementation (replicates paper results!) of MINER: Multiscale Implicit Neural Representations in pytorch-lightning

All the code and files related to the MI-Lab of UE19CS305 course in sem 5

Lyapunov-guided Deep Reinforcement Learning for Stable Online Computation Offloading in Mobile-Edge Computing Networks

BridgeGAN - Tensorflow implementation of Bridging the Gap between Label- and Reference-based Synthesis in Multi-attribute Image-to-Image Translation.

External Attention Network

RDA: Robust Domain Adaptation via Fourier Adversarial Attacking

Old Photo Restoration (Official PyTorch Implementation)

Implementation of parameterized soft-exponential activation function.

Empowering journalists and whistleblowers

Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)

Implementation of Fast Transformer in Pytorch

Official repository for Natural Image Matting via Guided Contextual Attention

Official PyTorch implementation of "Meta-Learning with Task-Adaptive Loss Function for Few-Shot Learning" (ICCV2021 Oral)

FCN (Fully Convolutional Network) is deep fully convolutional neural network architecture for semantic pixel-wise segmentation

Implementation of average- and worst-case robust flatness measures for adversarial training.

Video-based open-world segmentation

Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

PyTorch implementation of 'Gen-LaneNet: a generalized and scalable approach for 3D lane detection'

HAR-stacked-residual-bidir-LSTMs - Deep stacked residual bidirectional LSTMs for HAR

Transformers are Graph Neural Networks!