TraSw for FairMOT - A Single-Target Attack example (Attack ID: 19; Screener ID: 24):

Last update: Dec 21, 2022

Related tags

Deep Learning FairMOT-attack

Overview

TraSw for FairMOT

A Single-Target Attack example (Attack ID: 19; Screener ID: 24):

Fig.1 Original

Fig.2 Attacked

By perturbing only two frames in this example video, we can exchange the 19th ID and the 24th ID completely. Starting from frame 592, the 19th and 24th IDs can keep the exchange without noise.

TraSw: Tracklet-Switch Adversarial Attacks against Multi-Object Tracking,
Delv Lin, Qi Chen, Chengyu Zhou, Kun He,
arXiv 2111.08954

Related Works

TraSw for ByteTrack

Abstract

Benefiting from the development of Deep Neural Networks, Multi-Object Tracking (MOT) has achieved aggressive progress. Currently, the real-time Joint-Detection-Tracking (JDT) based MOT trackers gain increasing attention and derive many excellent models. However, the robustness of JDT trackers is rarely studied, and it is challenging to attack the MOT system since its mature association algorithms are designed to be robust against errors during tracking. In this work, we analyze the weakness of JDT trackers and propose a novel adversarial attack method, called Tracklet-Switch (TraSw), against the complete tracking pipeline of MOT. Specifically, a push-pull loss and a center leaping optimization are designed to generate adversarial examples for both re-ID feature and object detection. TraSw can fool the tracker to fail to track the targets in the subsequent frames by attacking very few frames. We evaluate our method on the advanced deep trackers (i.e., FairMOT, JDE, ByteTrack) using the MOT-Challenge datasets (i.e., 2DMOT15, MOT17, and MOT20). Experiments show that TraSw can achieve a high success rate of over 95% by attacking only five frames on average for the single-target attack and a reasonably high success rate of over 80% for the multiple-target attack.

Attack Performance

Single-Target Attack Results on MOT challenge test set

Dataset	Suc. Rate	Avg. Frames	Avg. L₂ Distance
2DMOT15	95.37%	4.67	3.55
MOT17	96.35%	5.61	3.23
MOT20	98.89%	4.12	3.12

Multiple-Target Attack Results on MOT challenge test set

Dataset	Suc. Rate	Avg. Frames (Proportion)	Avg. L₂ Distance
2DMOT15	81.95%	35.06%	2.79
MOT17	82.01%	38.85%	2.71
MOT20	82.02%	54.35%	3.28

Installation

same as FairMOT
Clone this repo, and we'll call the directory that you cloned as ${FA_ROOT}
Install dependencies. We use python 3.7 and pytorch >= 1.2.0

conda create -n FA
conda activate FA
conda install pytorch==1.2.0 torchvision==0.4.0 cudatoolkit=10.0 -c pytorch
cd ${FA_ROOT}
pip install -r requirements.txt
cd src/lib/models/networks/DCNv2 sh make.sh

We use DCNv2 in our backbone network and more details can be found in their repo.
In order to run the code for demos, you also need to install ffmpeg.

Data preparation

We only use the same test data as FairMOT.

2DMOT15, MOT17 and MOT20 can be downloaded from the official webpage of MOT-Challenge. After downloading, you should prepare the data in the following structure:

${DATA_DIR}
    ├── MOT15
    │   └── images
    │       ├── test
    │       └── train
    ├── MOT17
    │   └── images
    │       ├── test
    │       └── train
    └── MOT20
        └── images
            ├── test
            └── train

Target Model

We choose DLA-34: [Google] [Baidu, code: 88yn] trained by FairMOT as our primary target model.

Tracking without Attack

tracking on original videos of 2DMOT15, MOT17, and MOT20

cd src
python track.py mot --test_mot15 True --load_model all_dla34.pth --conf_thres 0.3 --data_dir ${DATA_DIR} --output_dir ${OUTPUT_DIR}
python track.py mot --test_mot17 True --load_model all_dla34.pth --conf_thres 0.4 --data_dir ${DATA_DIR} --output_dir ${OUTPUT_DIR}
python track.py mot --test_mot20 True --load_model all_dla34.pth --conf_thres 0.3 --data_dir ${DATA_DIR} --output_dir ${OUTPUT_DIR}

Attack

Single-Target Attack

attack all attackable objects separately in videos in parallel (may require a lot of memory).

cd src
python track.py mot --test_mot15 True --load_model all_dla34.pth --conf_thres 0.3 --data_dir ${DATA_DIR} --output_dir ${OUTPUT_DIR} --attack single --attack_id -1
python track.py mot --test_mot17 True --load_model all_dla34.pth --conf_thres 0.4 --data_dir ${DATA_DIR} --output_dir ${OUTPUT_DIR} --attack single --attack_id -1
python track.py mot --test_mot20 True --load_model all_dla34.pth --conf_thres 0.3 --data_dir ${DATA_DIR} --output_dir ${OUTPUT_DIR} --attack single --attack_id -1

attack a specific object in a specific video (require to set specific video in src/track.py).

cd src
python track.py mot --test_mot15 True --load_model all_dla34.pth --conf_thres 0.3 --data_dir ${DATA_DIR} --output_dir ${OUTPUT_DIR} --attack single --attack_id ${a specific id in origial tracklets}
python track.py mot --test_mot17 True --load_model all_dla34.pth --conf_thres 0.4 --data_dir ${DATA_DIR} --output_dir ${OUTPUT_DIR} --attack single --attack_id ${a specific id in origial tracklets}
python track.py mot --test_mot20 True --load_model all_dla34.pth --conf_thres 0.3 --data_dir ${DATA_DIR} --output_dir ${OUTPUT_DIR} --attack single --attack_id ${a specific id in origial tracklets}

Multiple-Targets Attack

attack all attackable objects in videos.

cd src
python track.py mot --test_mot15 True --load_model all_dla34.pth --conf_thres 0.3 --data_dir ${DATA_DIR} --output_dir ${OUTPUT_DIR} --attack multiple
python track.py mot --test_mot17 True --load_model all_dla34.pth --conf_thres 0.4 --data_dir ${DATA_DIR} --output_dir ${OUTPUT_DIR} --attack multiple
python track.py mot --test_mot20 True --load_model all_dla34.pth --conf_thres 0.3 --data_dir ${DATA_DIR} --output_dir ${OUTPUT_DIR} --attack multiple

Acknowledgement

This source code is based on FairMOT. Thanks for their wonderful works.

Citation

@misc{lin2021trasw,
      title={TraSw: Tracklet-Switch Adversarial Attacks against Multi-Object Tracking}, 
      author={Delv Lin and Qi Chen and Chengyu Zhou and Kun He},
      year={2021},
      eprint={2111.08954},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

TraSw for FairMOT - A Single-Target Attack example (Attack ID: 19; Screener ID: 24):

Related tags

Overview

TraSw for FairMOT

Abstract

Attack Performance

Installation

Data preparation

Target Model

Tracking without Attack

Attack

Single-Target Attack

Multiple-Targets Attack

Acknowledgement

Citation

Owner

Derry Lin

Open-L2O: A Comprehensive and Reproducible Benchmark for Learning to Optimize Algorithms

A repo with study material, exercises, examples, etc for Devnet SPAUTO

Jupyter notebooks for the code samples of the book "Deep Learning with Python"

JupyterNotebook - C/C++, Javascript, HTML, LaTex, Shell scripts in Jupyter Notebook Also run them on remote computer

Code of paper Interact, Embed, and EnlargE (IEEE): Boosting Modality-specific Representations for Multi-Modal Person Re-identification.

A Python framework for developing parallelized Computational Fluid Dynamics software to solve the hyperbolic 2D Euler equations on distributed, multi-block structured grids.

An image processing project uses Viola-jones technique to detect faces and then use SIFT algorithm for recognition.

LowRankModels.jl is a julia package for modeling and fitting generalized low rank models.

This is the second place solution for : UmojaHack Africa 2022: African Snake Antivenom Binding Challenge

Repository to run object detection on a model trained on an autonomous driving dataset.

Caffe: a fast open framework for deep learning.

Differentiable architecture search for convolutional and recurrent networks

Deep Sea Treasure Environment for Multi-Objective Optimization Research

Code for testing convergence rates of Lipschitz learning on graphs

A Re-implementation of the paper "A Deep Learning Framework for Character Motion Synthesis and Editing"

PyTorch implementation of paper “Unbiased Scene Graph Generation from Biased Training”

Pytorch based library to rank predicted bounding boxes using text/image user's prompts.

Pytorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)

Spatial Contrastive Learning for Few-Shot Classification (SCL)

Cross-media Structured Common Space for Multimedia Event Extraction (ACL2020)