Rotation Robust Descriptors

Last update: Nov 15, 2022

Overview

RoRD

Rotation-Robust Descriptors and Orthographic Views for Local Feature Matching

Project Page | Paper link

Evaluation and Datasets

MMA : Training on PhotoTourism and testing on HPatches and proposed Rotated HPatches
Pose Estimation : Training on same PhotoTourism datasets as used for MMA and testing on proposed DiverseView
Visual Place Recognition : Oxford RobotCar training sequence and testing sequence

Pretrained Models

Download models from Google Drive (73.9 MB) in the base directory.

Evaluating RoRD

You can evaluate RoRD on demo images or replace it with your custom images.

Dependencies can be installed in a conda of virtualenv by running:
1. pip install -r requirements.txt
python extractMatch.py <rgb_image1> <rgb_image2> --model_file <path to the model file RoRD>
Example:
python extractMatch.py demo/rgb/rgb1_1.jpg demo/rgb/rgb1_2.jpg --model_file models/rord.pth
This should give you output like this:

RoRD

SIFT

DiverseView Dataset

Download dataset from Google Drive (97.8 MB) in the base directory (only needed if you want to evaluate on DiverseView Dataset).

Evaluation on DiverseView Dataset

The DiverseView Dataset is a custom dataset consisting of 4 scenes with images having high-angle camera rotations and viewpoint changes.

Pose estimation on single image pair of DiverseView dataset:
1. cd demo
2. python register.py --rgb1 <path to rgb image 1> --rgb2 <path to rgb image 2> --depth1 <path to depth image 1> --depth2 <path to depth image 2> --model_rord <path to the model file RoRD>
3. Example:
  python register.py --rgb1 rgb/rgb2_1.jpg --rgb2 rgb/rgb2_2.jpg --depth1 depth/depth2_1.png --depth2 depth/depth2_2.png --model_rord ../models/rord.pth
4. This should give you output like this:

RoRD matches in perspective view

RoRD matches in orthographic view

To visualize the registered point cloud, use --viz3d command:
1. python register.py --rgb1 rgb/rgb2_1.jpg --rgb2 rgb/rgb2_2.jpg --depth1 depth/depth2_1.png --depth2 depth/depth2_2.png --model_rord ../models/rord.pth --viz3d

PointCloud registration using correspondences

Pose estimation on a sequence of DiverseView dataset:
1. cd evaluation/DiverseView/
2. python evalRT.py --dataset <path to DiverseView dataset> --sequence <sequence name> --model_rord <path to RoRD model> --output_dir <name of output dir>
3. Example:
  1. python evalRT.py --dataset /path/to/preprocessed/ --sequence data1 --model_rord ../../models/rord.pth --output_dir out
4. This would generate out folder containing predicted transformations and matching results in out/vis folder, containing images like below:

RoRD

Training RoRD on PhotoTourism Images

Training using rotation homographies with initialization from D2Net weights (Download base models as mentioned in Pretrained Models).
Download branderburg_gate dataset that is used in the configs/train_scenes_small.txt from here(5.3 Gb) in phototourism folder.

Folder stucture should be:

phototourism/  
___ brandenburg_gate  
___ ___ dense  
___ ___	___ images  
___ ___	___ stereo  
___ ___	___ sparse

python trainPT_ipr.py --dataset_path <path_to_phototourism_folder> --init_model models/d2net.pth --plot

TO-DO

Provide VPR code
Provide combine training of RoRD + D2Net
Provide code for calculating error in Diverseview Dataset

Credits

Our base model is borrowed from D2-Net.

BibTex

If you use this code in your project, please cite the following paper:

@misc{rord2021,
      title={RoRD: Rotation-Robust Descriptors and Orthographic Views for Local Feature Matching}, 
      author={Udit Singh Parihar and Aniket Gujarathi and Kinal Mehta and Satyajit Tourani and Sourav Garg and Michael Milford and K. Madhava Krishna},
      year={2021},
      eprint={2103.08573},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Rotation Robust Descriptors

Related tags

Overview

RoRD

Evaluation and Datasets

Pretrained Models

Evaluating RoRD

RoRD

SIFT

DiverseView Dataset

Evaluation on DiverseView Dataset

RoRD matches in perspective view

RoRD matches in orthographic view

PointCloud registration using correspondences

RoRD

Training RoRD on PhotoTourism Images

TO-DO

Credits

BibTex

Owner

Udit Singh Parihar

Implementation of Deep Deterministic Policy Gradiet Algorithm in Tensorflow

Pytorch Implementation of paper "Noisy Natural Gradient as Variational Inference"

Referring Video Object Segmentation

Official PyTorch implementation of our AAAI22 paper: TransMEF: A Transformer-Based Multi-Exposure Image Fusion Framework via Self-Supervised Multi-Task Learning. Code will be available soon.

This repository contains the source code of our work on designing efficient CNNs for computer vision

Fuzzing the Kernel Using Unicornafl and AFL++

Automatically measure the facial Width-To-Height ratio and get facial analysis results provided by Microsoft Azure

PyTorch implementation of "Optimization Planning for 3D ConvNets"

A curated list of awesome game datasets, and tools to artificial intelligence in games

Extract MNIST handwritten digits dataset binary file into bmp images

The repository contain code for building compiler using puthon.

mmfewshot is an open source few shot learning toolbox based on PyTorch

Happywhale - Whale and Dolphin Identification Silver🥈 Solution (26/1588)

Split Variational AutoEncoder

Synthetic Scene Text from 3D Engines

fastgradio is a python library to quickly build and share gradio interfaces of your trained fastai models.

Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!

A colab notebook for training Stylegan2-ada on colab, transfer learning onto your own dataset.

Code for the paper BERT might be Overkill: A Tiny but Effective Biomedical Entity Linker based on Residual Convolutional Neural Networks

The official codes of "Semi-supervised Models are Strong Unsupervised Domain Adaptation Learners".