4st place solution for the PBVS 2022 Multi-modal Aerial View Object Classification Challenge - Track 1 (SAR) at PBVS2022

Last update: Nov 09, 2022

Overview

A Two-Stage Shake-Shake Network for Long-tailed Recognition of SAR Aerial View Objects

4st place solution for the PBVS 2022 Multi-modal Aerial View Object Classification Challenge - Track 1 (SAR)

Challenge Site

Overview

Synthetic Aperture Radar (SAR) has received more attention due to its complementary superiority on capturing significant information in the remote sensing area. However, for an Aerial View Object Classification (AVOC) task, SAR images still suffer from the long-tailed distribution of the aerial view objects. This disparity dampens the performance of classification methods, especially for the datasensitive deep learning models. In this paper, we propose a two-stage shake-shake network to tackle the long-tailed learning problem. Specifically, it decouples the learning procedure into the representation learning stage and the classification learning stage. Moreover, we apply the test time augmentation (TTA) and a post-processing approach (CAN) to improve the accuracy. In the PBVS 2022 Multi-modal Aerial View Object Classification Challenge Track 1, our method achieves 21.82% and 27.97% accuracy in the development phase and testing phase respectively, which achieves the top-tier among all the participants.

Requirements

Ubuntu (It's only tested on Ubuntu, so it may not work on Windows.)
Python >= 3.7
PyTorch >= 1.4.0
torchvision
```
pip install -r requirements.txt
```

Usage

The first stage training

python train.py --config ./configs/sar10/shake_shake.yaml

You need to change the value of “dataset_dir”, “dataset_dir_val”, under the “dataset” field and “output_dir” under the “train” field in the file “./configs/sar10/shake_shake.yaml”。

The second stage training

python train.py --config ./configs/sar10/shake_shake_fc.yaml

You need to change the value of “dataset_dir”, “dataset_dir_val” under the “dataset” field and “output_dir”, “checkpoint” under the “train” field in the file “./configs/sar10/shake_shake_fc.yaml”。

Test

python predict_TTA.py

You need to change the value of “dataset_dir”, “checkpoint”, under the “test” field in the file “./configs/sar10/shake_shake.yaml”, then you can find the results in file “.result/results.csv”。
You can download the trained model here.

Acknowledge

The codes borrow heavily from hysts/pytorch_image_classification.

4st place solution for the PBVS 2022 Multi-modal Aerial View Object Classification Challenge - Track 1 (SAR) at PBVS2022

Related tags

Overview

A Two-Stage Shake-Shake Network for Long-tailed Recognition of SAR Aerial View Objects

Overview

Requirements

Usage

The first stage training

The second stage training

Test

Acknowledge

Owner

LinpengPan

Composing methods for ML training efficiency

(CVPR 2022) Energy-based Latent Aligner for Incremental Learning

Training deep models using anime, illustration images.

This repository contains the map content ontology used in narrative cartography

PyTorch implementation of SQN based on CloserLook3D's encoder

Modular Probabilistic Programming on MXNet

Learning RAW-to-sRGB Mappings with Inaccurately Aligned Supervision (ICCV 2021)

Repo for the paper Extrapolating from a Single Image to a Thousand Classes using Distillation

Download files from DSpace systems (because for some reason DSpace won't let you)

To prepare an image processing model to classify the type of disaster based on the image dataset

Official repository for "Orthogonal Projection Loss" (ICCV'21)

Deep Learning Interviews book: Hundreds of fully solved job interview questions from a wide range of key topics in AI.

Evolving neural network parameters in JAX.

Code for CVPR2021 paper 'Where and What? Examining Interpretable Disentangled Representations'.

Unsupervised Video Interpolation using Cycle Consistency

Multi-Glimpse Network With Python

Code for GNMR in ICDE 2021

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

Using multidimensional LSTM neural networks to create a forecast for Bitcoin price

PyTorch package for the discrete VAE used for DALL·E.