Original code for "Zero-Shot Domain Adaptation with a Physics Prior"

Last update: Dec 21, 2022

Related tags

Overview

Zero-Shot Domain Adaptation with a Physics Prior

[arXiv] [sup. material] - ICCV 2021 Oral paper, by Attila Lengyel, Sourav Garg, Michael Milford and Jan van Gemert.

This repository contains the PyTorch implementation of Color Invariant Convolutions and all experiments and datasets described in the paper.

Abstract

We explore the zero-shot setting for day-night domain adaptation. The traditional domain adaptation setting is to train on one domain and adapt to the target domain by exploiting unlabeled data samples from the test set. As gathering relevant test data is expensive and sometimes even impossible, we remove any reliance on test data imagery and instead exploit a visual inductive prior derived from physics-based reflection models for domain adaptation. We cast a number of color invariant edge detectors as trainable layers in a convolutional neural network and evaluate their robustness to illumination changes. We show that the color invariant layer reduces the day-night distribution shift in feature map activations throughout the network. We demonstrate improved performance for zero-shot day to night domain adaptation on both synthetic as well as natural datasets in various tasks, including classification, segmentation and place recognition.

Getting started

All code and experiments have been tested with PyTorch 1.7.0.

Create a local clone of this repository:

git clone https://github.com/Attila94/CIConv

The method directory contains the color invariant convolution (CIConv) layer, as well as custom ResNet and VGG models using the CIConv layer. To use the CIConv layer in your own architecture, simply copy ciconv2d.py to the desired directory and add it as a regular PyTorch layer as

from ciconv2d import CIConv2d
ciconv = CIConv2d('W', k=3, scale=0.0)

See resnet.py and vgg.py for examples.

Datasets

Shapenet Illuminants

[Download link]

Shapenet Illuminants is used in the synthetic classification experiment. The images are rendered from a subset of the ShapeNet dataset using the physically based renderer Mitsuba. The scene is illuminated by a point light modeled as a black-body radiator with temperatures ranging between [1900, 20000] K and an ambient light source. The training set contains 1,000 samples for each of the 10 object classes recorded under "normal" lighting conditions (T = 6500 K). Multiple test sets with 300 samples per class are rendered for a variety of light source intensities and colors.

Common Objects Day and Night

[Download link]

Common Objects Day and Night (CODaN) is a natural day-night image classification dataset. More information can be found on the separate Github repository: https://github.com/Attila94/CODaN.

Experiments

1. Synthetic classification

Download [link] and unpack the Shapenet Illuminants dataset.
In your local CIConv clone navigate to experiments/1_synthetic_classification and run

python train.py --root 'path/to/shapenet_illuminants' --hflip --seed 0 --invariant 'W'

This will train a ResNet-18 with the 'W' color invariant from scratch and evaluate on all test sets.

Classification accuracy of ResNet-18 with various color invariants. RGB (not invariant) performance degrades when illumination conditions differ between train and test set, while color invariants remain more stable. W performs best overall.

2. CODaN classification

Download the Common Objects Day and Night (CODaN) dataset from https://github.com/Attila94/CODaN.
In your local CIConv clone navigate to experiments/2_codan_classification and run

python train.py --root 'path/to/codan' --invariant 'W' --scale 0. --hflip --jitter 0.3 --rr 20 --seed 0

This will train a ResNet-18 with the 'W' color invariant from scratch and evaluate on all test sets.

Selected results from the paper:

Method	Day (% accuracy)	Night (% accuracy)
Baseline	80.39 +- 0.38	48.31 +- 1.33
E	79.79 +- 0.40	49.95 +- 1.60
W	81.49 +- 0.49	59.67 +- 0.93
C	78.04 +- 1.08	53.44 +- 1.28
N	77.44 +- 0.00	52.03 +- 0.27
H	75.20 +- 0.56	50.52 +- 1.34

3. Semantic segmentation

Download and unpack the following public datasets: Cityscapes, Nighttime Driving, Dark Zurich.
In your local CIConv clone navigate to experiments/3_segmentation.
Set the proper dataset locations in train.py.

Run

python train.py --hflip --rc --jitter 0.3 --scale 0.3 --batch-size 6 --pretrained --invariant 'W'

Selected results from the paper:

Method	Nighttime Driving (mIoU)	Dark Zurich (mIoU)
RefineNet [baseline]	34.1	30.6
W-RefineNet [ours]	41.6	34.5

4. Visual place recognition

Setup conda environment

conda create -n ciconv python=3.9 mamba -c conda-forge
conda activate ciconv
mamba install pytorch==1.7.1 torchvision==0.8.2 torchaudio==0.7.2 cudatoolkit=10.1 scikit-image -c pytorch

Navigate to experiments/4_visual_place_recognition/cnnimageretrieval-pytorch/.

Run

git submodule update --init # download a fork of cnnimageretrieval-pytorch
sh cirtorch/utils/setup_tests.sh # download datasets and pre-trained models 
python3 -m cirtorch.examples.test --network-path data/networks/retrieval-SfM-120k_w_resnet101_gem/model.path.tar --multiscale '[1, 1/2**(1/2), 1/2]' --datasets '247tokyo1k' --whitening 'retrieval-SfM-120k'

Use --network-path retrievalSfM120k-resnet101-gem to compare against the vanilla method (without using the color invariant trained ResNet101).
Use --datasets 'gp_dl_nr' to test on the GardensPointWalking dataset.

Selected results from the paper:

Method	Tokyo 24/7 (mAP)
ResNet101 GeM [baseline]	85.0
W-ResNet101 GeM [ours]	88.3

Citation

If you find this repository useful for your work, please cite as follows:

@article{lengyel2021zeroshot,
      title={Zero-Shot Domain Adaptation with a Physics Prior}, 
      author={Attila Lengyel and Sourav Garg and Michael Milford and Jan C. van Gemert},
      year={2021},
      eprint={2108.05137},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Original code for "Zero-Shot Domain Adaptation with a Physics Prior"

Related tags

Overview

Zero-Shot Domain Adaptation with a Physics Prior

Abstract

Getting started

Datasets

Shapenet Illuminants

Common Objects Day and Night

Experiments

1. Synthetic classification

2. CODaN classification

3. Semantic segmentation

4. Visual place recognition

Citation

Owner

Attila Lengyel

Mask2Former: Masked-attention Mask Transformer for Universal Image Segmentation in TensorFlow 2

Official implementation for “Unsupervised Low-Light Image Enhancement via Histogram Equalization Prior”

Apply a perspective transformation to a raster image inside Inkscape (no need to use an external software such as GIMP or Krita).

Code for "ShineOn: Illuminating Design Choices for Practical Video-based Virtual Clothing Try-on", accepted at WACV 2021 Generation of Human Behavior Workshop.

ESL: Event-based Structured Light

Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorch

Code for the paper One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation, CVPR 2021.

UAV-Networks-Routing is a Python simulator for experimenting routing algorithms and mac protocols on unmanned aerial vehicle networks.

Supplementary code for TISMIR paper "Sliding-Window Pitch-Class Histograms as a Means of Modeling Musical Form"

DeepFill v1/v2 with Contextual Attention and Gated Convolution, CVPR 2018, and ICCV 2019 Oral

A high-performance distributed deep learning system targeting large-scale and automated distributed training.

Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation" in EMNLP 2021

MMFlow is an open source optical flow toolbox based on PyTorch

Official PyTorch code for WACV 2022 paper "CFLOW-AD: Real-Time Unsupervised Anomaly Detection with Localization via Conditional Normalizing Flows"

A simple tutoral for error correction task, based on Pytorch

This provides the R code and data to replicate results in "The USS Trustee’s risky strategy"

This repo is the code release of EMNLP 2021 conference paper "Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories".

The official homepage of the COCO-Stuff dataset.

Tree-based Search Graph for Approximate Nearest Neighbor Search

Pytorch implementation for ACMMM2021 paper "I2V-GAN: Unpaired Infrared-to-Visible Video Translation".