Learning Camera Localization via Dense Scene Matching, CVPR2021

Last update: Dec 01, 2022

Related tags

Overview

This repository contains code of our CVPR 2021 paper - "Learning Camera Localization via Dense Scene Matching" by Shitao Tang, Chengzhou Tang, Rui Huang, Siyu Zhu and Ping Tan.

This paper presents a new method for scene agnostic camera localization using dense scene matching (DSM), where a cost volume is constructed between a query image and a scene. The cost volume and the corresponding coordinates are processed by a CNN to predict dense coordinates. Camera poses can then be solved by PnP algorithms.

If you find this project useful, please cite:

@inproceedings{Tang2021Learning,
  title={Learning Camera Localization via Dense Scene Matching},
  author={Shitao Tang, Chengzhou Tang, Rui Huang, Siyu Zhu and Ping Tan},
  booktitle={Computer Vision and Pattern Recognition (CVPR)},
  year={2021}
}

Usage

Environment

The codes are tested along with
- pytorch=1.4.0
- lmdb (optional)
- yaml
- skimage
- opencv
- numpy=1.17
- tensorboard

Installation

Build PyTorch operations

  cd libs/model/ops
  python setup.py install

Build PnP algorithm

  cd libs/utils/lm_pnp
  mkdir build
  cd build
  cmake ..
  make all

Train and Test

Download

You can download the trained models and label files for 7scenes, Cambridge, Scannet.

For 7scenes, you can use the prepared data in the following.

Chess Fire Heads Office Pumpkin Kitchen Stairs

For Cambridge landmarks, you can download image files here, and depths here.
Test

Please refer to configs/7scenes.yaml for detailed explaination of how to set label file path and image file path.
- 7scenes
```
python tools/video_test.py --config configs/7scenes.yaml
```
- Camrbrige
```
python tools/video_test.py --config configs/cambridge.yaml
```
Train

We use ResNet-FPN pretrained model.
```
  python tools/train_net.py
```

Learning Camera Localization via Dense Scene Matching, CVPR2021

Related tags

Overview

Usage

Environment

Installation

Train and Test

Owner

tangshitao

POT : Python Optimal Transport

Binarize document images

QED-C: The Quantum Economic Development Consortium provides these computer programs and software for use in the fields of quantum science and engineering.

[ICCV, 2021] Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

OCR, Scene-Text-Understanding, Text Recognition

code for our ICCV 2021 paper "DeepCAD: A Deep Generative Network for Computer-Aided Design Models"

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:

Responsive Doc. scanner using U^2-Net, Textcleaner and Tesseract

This is a implementation of CRAFT OCR method

Make OpenCV camera loops less of a chore by skipping the boilerplate and getting right to the interesting stuff

Ddddocr - 通用验证码识别OCR pypi版

An easy to use an (hopefully useful) captcha solution for pyTelegramBotAPI

Using python libraries to track hands

Random maze generator and solver

CRAFT-Pyotorch：Character Region Awareness for Text Detection Reimplementation for Pytorch

BD-ALL-DIGIT - This Is Bangladeshi All Sim Cloner Tools

Read Japanese manga inside browser with selectable text.

Memory tests solver with using OpenCV

Aloception is a set of package for computer vision: aloscene, alodataset, alonet.

Augmenting Anchors by the Detector Itself