Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Last update: Apr 04, 2022

Related tags

Deep Learning FSAC

Overview

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Main requirements

torch >= 1.0

torchvision >= 0.2.0

Python 3

Environmental settings

This repository is developed using python 3.6.12 on Ubuntu 16.04.5 LTS. The CUDA and pytorch version is 11.2 and 1.7.1. We use one NVIDIA 3090 GPU card for training and testing.

Dataset

PASCAL VOC, Watercolor, Cityscapes, Foggycityscapes -> Please follow the instructions in [Link] to prepare the datasets.

Daytime-Sunny, Dusk-Rainy, and Night-Rainy -> Dataset preparation instruction link [Link].

Code

Faster R-CNN -> Thanks for jwyang [Link]; Fourier Domain Adaptation -> Thanks for Yanchao Yang [Link].

Our Augmentation (Mix+Replace+Extend+Disorder).

Train

To train a faster R-CNN model with vgg16 on pascal_voc:

CUDA_VISIBLE_DEVICES=$GPU_ID python trainval_net.py --dataset pascal_voc --net vgg16 --bs 1 --cuda

And you need to add augmentated data in the loadpath by creating a new dataset_name variable.

Test

To test:

python test_net.py --dataset pascal_voc --net vgg16 --modelpath your modelpath --cuda

Augmentation

Daytime-Sunny -> Dusk-Rainy

Daytime-Sunny -> Night-Rainy

Result

Results on adaptation from Cityscapes to FoggyCityscapes. ‘prsn’, ‘mcycl’, and ‘bcycl’ separately denote ‘person’, ‘motorcycle’, and ‘bicycle’ category.

Results on adaptation from Daytime-sunny to Duskrainy. Here, we directly run the released codes of the compared methods to obtain the results.

Results on Daytime-sunny → Night-rainy.

Results on the compound target domain.

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Related tags

Overview

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Main requirements

Environmental settings

Dataset

Code

Train

Test

Augmentation

Result

Owner

MarcoPolo is a clustering-free approach to the exploration of bimodally expressed genes along with group information in single-cell RNA-seq data

Indonesian Car License Plate Character Recognition using Tensorflow, Keras and OpenCV.

A novel framework to automatically learn high-quality scanning of non-planar, complex anisotropic appearance.

U-Net for GBM

Annotate datasets with a semi-trained or fully trained YOLOv5 model

Ranger deep learning optimizer rewrite to use newest components

Single-stage Keypoint-based Category-level Object Pose Estimation from an RGB Image

Hand Gesture Volume Control | Open CV | Computer Vision

Reimplementation of Dynamic Multi-scale filters for Semantic Segmentation.

Faster RCNN with PyTorch

Freecodecamp Scientific Computing with Python Certification; Solution for Challenge 2: Time Calculator

Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning

Zero-Cost Proxies for Lightweight NAS

My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control

Learning from Synthetic Shadows for Shadow Detection and Removal [Inoue+, IEEE TCSVT 2020].

StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation

PyTorch GPU implementation of the ES-RNN model for time series forecasting

A PyTorch implementation of a Factorization Machine module in cython.

Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation

A embed able annotation tool for end to end cross document co-reference