SCALoss: Side and Corner Aligned Loss for Bounding Box Regression (AAAI2022).

Last update: Sep 07, 2022

Related tags

Deep Learning SCALoss

Overview

SCALoss

PyTorch implementation of the paper "SCALoss: Side and Corner Aligned Loss for Bounding Box Regression" (AAAI 2022).

Introduction

IoU-based loss has the gradient vanish problem in the case of low overlapping bounding boxes with slow convergence speed.
Side Overlap can put more penalty for low overlapping bounding box cases and Corner Distance can speed up the convergence.
SCALoss, which combines Side Overlap and Corner Distance, can serve as a comprehensive similarity measure, leading to better localization performance and faster convergence speed.

Prerequisites

Python>=3.6.0
PyTorch>=1.7
Other dependencies described in requirements.txt

Install

Conda is not necessary for the installation. Nevertheless, the installation process here is described using it.

$ conda create -n sca-yolo python=3.8 -y
$ conda activate sca-yolo
$ git clone https://github.com/Turoad/SCALoss
$ cd SCALoss
$ pip install -r requirements.txt

Getting started

Train a model:

python train.py --data [dataset config] --cfg [model config] --weights [path of pretrain weights] --batch-size [batch size num]

For example, to train yolov3-tiny on COCO dataset from scratch with batch size=128.

python train.py --data coco.yaml --cfg yolov3-tiny.yaml --weights '' --batch-size 128

For multi-gpu training, it is recommended to use:

python -m torch.distributed.launch --nproc_per_node 4 train.py --img 640 --batch 32 --epochs 300 --data coco.yaml --weights '' --cfg yolov3.yaml --device 0,1,2,3

Test a model:

python val.py --data coco.yaml --weights runs/train/exp15/weights/last.pt --img 640 --iou-thres=0.65

Results and Checkpoints

YOLOv3-tiny

Model	mAP 0.5:0.95	AP 0.5	AP 0.65	AP 0.75	AP 0.8	AP 0.9
IoU	18.8	36.2	27.2	17.3	11.6	1.9
GIoU relative improv.(%)	18.8 0%	36.2 0%	27.1 -0.37%	17.6 1.73%	11.8 1.72%	2.1 10.53%
DIoU relative improv.(%)	18.8 0%	36.4 0.55%	26.9 -1.1%	17.2 -0.58%	11.8 1.72%	1.9 0%
CIoU relative improv.(%)	18.9 0.53%	36.6 1.1%	27.3 0.37%	17.2 -0.58%	11.6 0%	2.1 10.53%
SCA relative improv.(%)	19.9 5.85%	36.6 1.1%	28.3 4.04%	19.1 10.4%	13.3 14.66%	2.7 42.11%

The convergence curves of different losses on YOLOV3-tiny:

YOLOv3

Model	mAP 0.5:0.95	AP 0.5	AP 0.65	AP 0.75	AP 0.8	AP 0.9
IoU	44.8	64.2	57.5	48.8	41.8	20.7
GIoU relative improv.(%)	44.7 -0.22%	64.4 0.31%	57.5 0%	48.5 -0.61%	42 0.48%	20.4 -1.45%
DIoU relative improv.(%)	44.7 -0.22%	64.3 0.16%	57.5 0%	48.9 0.2%	42.1 0.72%	19.8 -4.35%
CIoU relative improv.(%)	44.7 -0.22%	64.3 0.16%	57.5 0%	48.9 0.2%	41.7 -0.24%	19.8 -4.35%
SCA relative improv.(%)	45.3 1.12%	64.1 -0.16%	57.9 0.7%	49.9 2.25%	43.3 3.59%	21.4 3.38%

YOLOV5s

comming soon

Citation

If our paper and code are beneficial to your work, please consider citing:

@inproceedings{zheng2022scaloss,
  title={SCALoss: Side and Corner Aligned Loss for Bounding Box Regression},
  author={Zheng, Tu and Zhao, Shuai and Liu, Yang and Liu, Zili and Cai, Deng},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  year={2022}
}

Acknowledgement

The code is modified from ultralytics/yolov3.

An implementation for the loss function proposed in Decoupled Contrastive Loss paper.

Decoupled-Contrastive-Learning This repository is an implementation for the loss function proposed in Decoupled Contrastive Loss paper. Requirements P

71 Dec 4, 2022

Implement of "Training deep neural networks via direct loss minimization" in PyTorch for 0-1 loss

This is the implementation of "Training deep neural networks via direct loss minimization" published at ICML 2016 in PyTorch. The implementation targe

1 Jan 18, 2022

Official PyTorch implementation of "Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition" in AAAI2022.

AimCLR This is an official PyTorch implementation of "Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Reco

44 Dec 17, 2022

CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes (AAAI2022)

SCALoss: Side and Corner Aligned Loss for Bounding Box Regression (AAAI2022).

Related tags

Overview

SCALoss

Introduction

Prerequisites

Install

Getting started

Results and Checkpoints

YOLOv3-tiny

YOLOv3

YOLOV5s

Citation

Acknowledgement

You might also like...

An implementation for the loss function proposed in Decoupled Contrastive Loss paper.

Implement of "Training deep neural networks via direct loss minimization" in PyTorch for 0-1 loss

Official PyTorch implementation of "Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition" in AAAI2022.

CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes (AAAI2022)

Repository for "Improving evidential deep learning via multi-task learning," published in AAAI2022

Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)

Code repository for paper `Skeleton Merger: an Unsupervised Aligned Keypoint Detector`.

Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)

Learning RAW-to-sRGB Mappings with Inaccurately Aligned Supervision (ICCV 2021)

Releases(models)

models(Apr 28, 2022)

Owner

TuZheng

Code for Blind Image Decomposition (BID) and Blind Image Decomposition network (BIDeN).

ElegantRL is featured with lightweight, efficient and stable, for researchers and practitioners.

Generating Images with Recurrent Adversarial Networks

An implementation of the AdaOPS (Adaptive Online Packing-based Search), which is an online POMDP Solver used to solve problems defined with the POMDPs.jl generative interface.

Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search

A python code to convert Keras pre-trained weights to Pytorch version

A short code in python, Enchpyter, is able to encrypt and decrypt words as you determine, of course

EdMIPS: Rethinking Differentiable Search for Mixed-Precision Neural Networks

Open-Ended Commonsense Reasoning (NAACL 2021)

PCAM: Product of Cross-Attention Matrices for Rigid Registration of Point Clouds

This repository includes different versions of the prescribed-time controller as Simulink blocks and MATLAB script codes for engineering applications.

Flower classification model that classifies flowers in 10 classes made using transfer learning (~85% accuracy).

This repository contains all code and data for the Inside Out Visual Place Recognition task

Experimental Python implementation of OpenVINO Inference Engine (very slow, limited functionality). All codes are written in Python. Easy to read and modify.

Style transfer, deep learning, feature transform

Implementation of "Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency"

A library for uncertainty quantification based on PyTorch

Tensorflow 2 implementations of the C-SimCLR and C-BYOL self-supervised visual representation methods from "Compressive Visual Representations" (NeurIPS 2021)

This repo contains the code for the paper "Efficient hierarchical Bayesian inference for spatio-temporal regression models in neuroimaging" that has been accepted to NeurIPS 2021.

CCPD: a diverse and well-annotated dataset for license plate detection and recognition