Instance-level Image Retrieval using Reranking Transformers

Fuwen Tan, Jiangbo Yuan, Vicente Ordonez, ICCV 2021.

Abstract

Instance-level image retrieval is the task of searching in a large database for images that match an object in a query image. To address this task, systems usually rely on a retrieval step that uses global image descriptors, and a subsequent step that performs domain-specific refinements or reranking by leveraging operations such as geometric verification based on local features. In this work, we propose Reranking Transformers (RRTs) as a general model to incorporate both local and global features to rerank the matching images in a supervised fashion and thus replace the relatively expensive process of geometric verification. RRTs are lightweight and can be easily parallelized so that reranking a set of top matching results can be performed in a single forward-pass. We perform extensive experiments on the Revisited Oxford and Paris datasets, and the Google Landmark v2 dataset, showing that RRTs outperform previous reranking approaches while using much fewer local descriptors. Moreover, we demonstrate that, unlike existing approaches, RRTs can be optimized jointly with the feature extractor, which can lead to feature representations tailored to downstream tasks and further accuracy improvements.

Software required

The code is only tested on Linux 64:

  conda create -n rrt python=3.6
  conda activate rrt
  pip install -r requirements.txt

Organization

To use the code for experiments on Google Landmarks v2, Revisited Oxford/Paris, please refer to the folder RRT_GLD.

To use the code for experiments on Stanford Online Products, please refer to the folder RRT_SOP.

To use the code for evaluating SuperGlue on Revisited Oxford/Paris and Stanford Online Products, please refer to the repo SuperGlue.

Citing

If you find our paper/code useful, please consider citing:

@inproceedings{fwtan-instance-2021,
    author = {Fuwen Tan and Jiangbo Yuan and Vicente Ordonez},
    title = {Instance-level Image Retrieval using Reranking Transformers},
    year = {2021},
    booktitle = {International Conference on Computer Vision (ICCV)}
 }

[ICCV 2021] Instance-level Image Retrieval using Reranking Transformers

Related tags

Overview

Instance-level Image Retrieval using Reranking Transformers

Abstract

Software required

Organization

Citing

Owner

UVA Computer Vision

justCTF [*] 2020 challenges sources

Chinese NER with albert/electra or other bert descendable model (keras)

The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniques

DensePhrases provides answers to your natural language questions from the entire Wikipedia in real-time

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Stack based programming language that compiles to x86_64 assembly or can alternatively be interpreted in Python

Various Algorithms for Short Text Mining

Sequence Modeling with Structured State Spaces

Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction

Open solution to the Toxic Comment Classification Challenge

A fast, efficient universal vector embedding utility package.

Words_And_Phrases - Just a repo for useful words and phrases that might come handy in some scenarios. Feel free to add yours

Lyrics generation with GPT2-based Transformer

End-to-end text to speech system using gruut and onnx. There are 40 voices available across 8 languages.

Twewy-discord-chatbot - Build a Discord AI Chatbot that Speaks like Your Favorite Character

Translation for Trilium Notes. Trilium Notes 中文版.

OpenAI CLIP text encoders for multiple languages!

The Easy-to-use Dialogue Response Selection Toolkit for Researchers

:hot_pepper: R²SQL: "Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing." (AAAI 2021)

2021 AI CUP Competition on Traditional Chinese Scene Text Recognition - Intermediate Contest