(3DV 2021 Oral) Filtering by Cluster Consistency for Large-Scale Multi-Image Matching

Last update: Sep 28, 2022

Overview

Scalable Cluster-Consistency Statistics for Robust Multi-Object Matching (3DV 2021 Oral Presentation)

Filtering by Cluster Consistency (FCC) is a very useful algorithm for filtering out wrong keypoint matches using cycle-consistency constraints. It is fast, accurate and memory efficient. It is purely based on sparse matrix operations and is completely decentralized. As a result, it is scalable to large matching matrix (millions by millions, as those in large scale SfM datasets e.g. Photo Tourism). It uses a special reweighting scheme, which can be viewed as a message passing procedure, to refine the classification of good/bad keypoint matches. The filtering result is often better than Spectral and SDP based methods and can be several order of magnitude faster.

To use our code, please cite the following paper: Yunpeng Shi, Shaohan Li, Tyler Maunu, Gilad Lerman. Scalable Cluster-Consistency Statistics for Robust Multi-Object Matching, International Conference on 3D Vision (3DV), 2021

Usage

Checkout the demo code Demo_FCC.m. A sample output is as follows:

>> Demo_FCC
generate initial camera adjacency matrix
create camera intrinsic matrices. f (focal length) is set to 5000 pixel sizes
generate 3d point cloud (a sphere)
generate camera locations from 3d gaussian dist with radius constraints
generating 2d keypoints from camera projection matrices
generating and corrupting keypoint matches
start running FCC
iteration 1 Completed!
iteration 2 Completed!
iteration 3 Completed!
iteration 4 Completed!
iteration 5 Completed!
iteration 6 Completed!
iteration 7 Completed!
iteration 8 Completed!
iteration 9 Completed!
iteration 10 Completed!
Elapsed time is 0.782890 seconds.
classification error (Jaccard distance) = 0.031733
precision rate = 0.973654
recall rate = 0.994319

It often gives almost perfect separation between good and bad matches even when a large fraction of clean keypoint matches are removed or corrupted. The classification result is often better (and much faster) than spectral-based methods. The following is an example of histograms of our FCC statistics for clean and wrong keypoint matches. Our statistic measures the confidence that a match is clean (good).

Flexible Input and Informative Output

The function FCC.m takes matching matrix (Adjacency matrix of the keypoint matching graph, where the indices of keypoints (nodes) are grouped by images) as input. In principle, the input can also be a SIFT feature (or other features) similarity matrix (so not necessarily binary). This function outputs the statistics matrix that tells you for each keypoint match its probability of being a good match. Thus, it contains the confidence information, not just classification results. One can set different threshold levels (tradeoff between precision and recall) for the statistics matrix to obtain the filtered matches, depending on the tasks.

A novel Synthetic Model

We provide a new synthetic model that realistically mirror the real scenario, and allows control of different parameters. Please check FCC_synthetic_data.m. It generates a set of synthetic cameras, images, 3d points and 2d keypoints. It allows user to control the sparsity in camera correspondences and keypoint matches, and the corruption level and corruption mode (elementwise or inlier-outlier model) for keypoint matches.

(3DV 2021 Oral) Filtering by Cluster Consistency for Large-Scale Multi-Image Matching

Related tags

Overview

Scalable Cluster-Consistency Statistics for Robust Multi-Object Matching (3DV 2021 Oral Presentation)

Usage

Flexible Input and Informative Output

A novel Synthetic Model

Owner

Yunpeng Shi

Home repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one written in C++, along with various language-specific wrappers.

Low Complexity Channel estimation with Neural Network Solutions

Lexical Substitution Framework

A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Review).

Spiking Neural Network for Computer Vision using SpikingJelly framework and Pytorch-Lightning

JFB: Jacobian-Free Backpropagation for Implicit Models

[ICLR 2022] Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics

Simple converter for deploying Stable-Baselines3 model to TFLite and/or Coral

Learning to Stylize Novel Views

Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks

Source code for our Paper "Learning in High-Dimensional Feature Spaces Using ANOVA-Based Matrix-Vector Multiplication"

Prometheus exporter for Cisco Unified Computing System (UCS) Manager

Teaches a student network from the knowledge obtained via training of a larger teacher network

Code artifacts for the submission "Mind the Gap! A Study on the Transferability of Virtual vs Physical-world Testing of Autonomous Driving Systems"

EFENet: Reference-based Video Super-Resolution with Enhanced Flow Estimation

Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes

The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.

Simulating Sycamore quantum circuits classically using tensor network algorithm.

Repo for CVPR2021 paper "QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information"

(3DV 2021 Oral) Filtering by Cluster Consistency for Large-Scale Multi-Image Matching

Related tags

Overview

Scalable Cluster-Consistency Statistics for Robust Multi-Object Matching (3DV 2021 Oral Presentation)

Usage

Flexible Input and Informative Output

A novel Synthetic Model

Owner

Yunpeng Shi

Home repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one written in C++, along with various language-specific wrappers.

Low Complexity Channel estimation with Neural Network Solutions

Lexical Substitution Framework

A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Review).

Spiking Neural Network for Computer Vision using SpikingJelly framework and Pytorch-Lightning

JFB: Jacobian-Free Backpropagation for Implicit Models

[ICLR 2022] Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics

Simple converter for deploying Stable-Baselines3 model to TFLite and/or Coral

Learning to Stylize Novel Views

Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks

Source code for our Paper "Learning in High-Dimensional Feature Spaces Using ANOVA-Based Matrix-Vector Multiplication"

Prometheus exporter for Cisco Unified Computing System (UCS) Manager

Teaches a student network from the knowledge obtained via training of a larger teacher network

Code artifacts for the submission "Mind the Gap! A Study on the Transferability of Virtual vs Physical-world Testing of Autonomous Driving Systems"

EFENet: Reference-based Video Super-Resolution with Enhanced Flow Estimation

Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes

The personal repository of the work: *DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer*.

Simulating Sycamore quantum circuits classically using tensor network algorithm.

Repo for CVPR2021 paper "QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information"

The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.