[CoRL 21'] TANDEM: Tracking and Dense Mapping in Real-time using Deep Multi-view Stereo

Last update: Jan 04, 2023

Overview

TANDEM: Tracking and Dense Mapping
in Real-time using Deep Multi-view Stereo

Lukas Koestler^1* Nan Yang^1,2*,† Niclas Zeller^2,3 Daniel Cremers^1,2

^*equal contribution ^†corresponding author

¹Technical University of Munich ²Artisense
³Karlsruhe University of Applied Sciences

Conference on Robot Learning (CoRL) 2021, London, UK

3DV 2021 Best Demo Award

arXiv | Video | OpenReview | Project Page

Code and Data

📣 CVA-MVSNet released! Please check cva_mvsnet/.
📣 Replica training data released! Please check replica/.
C++ code realse before Christmas. Thank you for your patience!

Abstract

In this paper, we present TANDEM a real-time monocular tracking and dense mapping framework. For pose estimation, TANDEM performs photometric bundle adjustment based on a sliding window of keyframes. To increase the robustness, we propose a novel tracking front-end that performs dense direct image alignment using depth maps rendered from a global model that is built incrementally from dense depth predictions. To predict the dense depth maps, we propose Cascade View-Aggregation MVSNet (CVA-MVSNet) that utilizes the entire active keyframe window by hierarchically constructing 3D cost volumes with adaptive view aggregation to balance the different stereo baselines between the keyframes. Finally, the predicted depth maps are fused into a consistent global map represented as a truncated signed distance function (TSDF) voxel grid. Our experimental results show that TANDEM outperforms other state-of-the-art traditional and learning-based monocular visual odometry (VO) methods in terms of camera tracking. Moreover, TANDEM shows state-of-the-art real-time 3D reconstruction performance.

[CoRL 21'] TANDEM: Tracking and Dense Mapping in Real-time using Deep Multi-view Stereo

Related tags

Overview

TANDEM: Tracking and Dense Mapping
in Real-time using Deep Multi-view Stereo

Code and Data

Abstract

Poster

Owner

TUM Computer Vision Group

Tensorflow implementation of Semi-supervised Sequence Learning (https://arxiv.org/abs/1511.01432)

MRI reconstruction (e.g., QSM) using deep learning methods

SkipGNN: Predicting Molecular Interactions with Skip-Graph Networks (Scientific Reports)

Epidemiology analysis package

Data visualization app for H&M competition in kaggle

Official implementation of Unfolded Deep Kernel Estimation for Blind Image Super-resolution.

RSC-Net: 3D Human Pose, Shape and Texture from Low-Resolution Images and Videos

A Python library for generating new text from existing samples.

Implementation of the "PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences" paper.

style mixing for animation face

Tutorial to set up TensorFlow Object Detection API on the Raspberry Pi

Zeyuan Chen, Yangchao Wang, Yang Yang and Dong Liu.

Code of paper "CDFI: Compression-Driven Network Design for Frame Interpolation", CVPR 2021

Semantic Segmentation for Real Point Cloud Scenes via Bilateral Augmentation and Adaptive Fusion (CVPR 2021)

Code for database and frontend of webpage for Neural Fields in Visual Computing and Beyond.

An implementation of Deep Forest 2021.2.1.

Official pytorch implementation of "Scaling-up Disentanglement for Image Translation", ICCV 2021.

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Applying curriculum to meta-learning for few shot classification

BED: A Real-Time Object Detection System for Edge Devices

[CoRL 21'] TANDEM: Tracking and Dense Mapping in Real-time using Deep Multi-view Stereo

Related tags

Overview

TANDEM: Tracking and Dense Mappingin Real-time using Deep Multi-view Stereo

Code and Data

Abstract

Poster

Owner

TUM Computer Vision Group

Tensorflow implementation of Semi-supervised Sequence Learning (https://arxiv.org/abs/1511.01432)

MRI reconstruction (e.g., QSM) using deep learning methods

SkipGNN: Predicting Molecular Interactions with Skip-Graph Networks (Scientific Reports)

Epidemiology analysis package

Data visualization app for H&M competition in kaggle

Official implementation of Unfolded Deep Kernel Estimation for Blind Image Super-resolution.

RSC-Net: 3D Human Pose, Shape and Texture from Low-Resolution Images and Videos

A Python library for generating new text from existing samples.

Implementation of the "PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences" paper.

style mixing for animation face

Tutorial to set up TensorFlow Object Detection API on the Raspberry Pi

Zeyuan Chen, Yangchao Wang, Yang Yang and Dong Liu.

Code of paper "CDFI: Compression-Driven Network Design for Frame Interpolation", CVPR 2021

Semantic Segmentation for Real Point Cloud Scenes via Bilateral Augmentation and Adaptive Fusion (CVPR 2021)

Code for database and frontend of webpage for Neural Fields in Visual Computing and Beyond.

An implementation of Deep Forest 2021.2.1.

Official pytorch implementation of "Scaling-up Disentanglement for Image Translation", ICCV 2021.

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Applying curriculum to meta-learning for few shot classification

BED: A Real-Time Object Detection System for Edge Devices

TANDEM: Tracking and Dense Mapping
in Real-time using Deep Multi-view Stereo