CAT-Net: Learning Canonical Appearance Transformations

Code to accompany our paper "How to Train a CAT: Learning Canonical Appearance Transformations for Direct Visual Localization Under Illumination Change".

Dependencies

numpy
matpotlib
pytorch + torchvision (1.2)
Pillow
progress (for progress bars in train/val/test loops)
tensorboard + tensorboardX (for visualization)
pyslam + liegroups (optional, for running odometry/localization experiments)
OpenCV (optional, for running odometry/localization experiments)

Training the CAT

Download the ETHL dataset from here or the Virtual KITTI dataset from here
1. ETHL only: rename ethl1/2 to ethl1/2_static.
2. ETHL only: Update the local paths in tools/make_ethl_real_sync.py and run python3 tools/make_ethl_real_sync.py to generate a synchronized copy of the real sequences.
Update the local paths in run_cat_ethl/vkitti.py and run python3 run_cat_ethl/vkitti.py to start training.
In another terminal run tensorboard --port [port] --logdir [path] to start the visualization server, where [port] should be replaced by a numeric value (e.g., 60006) and [path] should be replaced by your local results directory.
Tune in to localhost:[port] and watch the action.

Running the localization experiments

Ensure the pyslam and liegroups packages are installed.
Update the local paths in make_localization_data.py and run python3 make_localization_data.py [dataset] to compile the model outputs into a localization_data directory.
Update the local paths in run_localization_[dataset].py and run python3 run_localization_[dataset].py [rgb,cat] to compute VO and localization results using either the original RGB or CAT-transformed images.
You can compute localization errors against ground truth using the compute_localization_errors.py script, which generates CSV files and several plots. Update the local paths and run python3 compute_localization_errors.py [dataset].

Citation

If you use this code in your research, please cite:

@article{2018_Clement_Learning,
  author = {Lee Clement and Jonathan Kelly},
  journal = {{IEEE} Robotics and Automation Letters},
  link = {https://arxiv.org/abs/1709.03009},
  title = {How to Train a {CAT}: Learning Canonical Appearance Transformations for Direct Visual Localization Under Illumination Change},
  year = {2018}
}

Canonical Appearance Transformations

Related tags

Overview

CAT-Net: Learning Canonical Appearance Transformations

Dependencies

Training the CAT

Running the localization experiments

Citation

Owner

STARS Laboratory

Generative Query Network (GQN) in PyTorch as described in "Neural Scene Representation and Rendering"

Strongly local p-norm-cut algorithms for semi-supervised learning and local graph clustering

Recurrent Conditional Query Learning

An air quality monitoring service with a Raspberry Pi and a SDS011 sensor.

Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging

Studying Python release adoptions by looking at PyPI downloads

[ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets"

中文语音识别系列，读者可以借助它快速训练属于自己的中文语音识别模型，或直接使用预训练模型测试效果。

Pytorch implementation of Generative Models as Distributions of Functions 🌿

Official PyTorch implementation of MX-Font (Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts)

Learning from Synthetic Shadows for Shadow Detection and Removal [Inoue+, IEEE TCSVT 2020].

Python with OpenCV - MediaPip Framework Hand Detection

Code To Tune or Not To Tune? Zero-shot Models for Legal Case Entailment.

Drone-based Joint Density Map Estimation, Localization and Tracking with Space-Time Multi-Scale Attention Network

This is a beginner-friendly repo to make a collection of some unique and awesome projects. Everyone in the community can benefit & get inspired by the amazing projects present over here.

Improving Calibration for Long-Tailed Recognition (CVPR2021)

Code and data for ACL2021 paper Cross-Lingual Abstractive Summarization with Limited Parallel Resources.

RNN Predict Street Commercial Vitality

GoodNews Everyone! Context driven entity aware captioning for news images

Predicting lncRNA–protein interactions based on graph autoencoders and collaborative training