Python code to fuse multiple RGB-D images into a TSDF voxel volume.

Last update: Jan 03, 2023

Overview

Volumetric TSDF Fusion of RGB-D Images in Python

This is a lightweight python script that fuses multiple registered color and depth images into a projective truncated signed distance function (TSDF) volume, which can then be used to create high quality 3D surface meshes and point clouds. Tested on Ubuntu 16.04.

An older CUDA/C++ version can be found here.

Requirements

Python 2.7+ with NumPy, PyCUDA, OpenCV, Scikit-image and Numba. These can be quickly installed/updated by running the following:
```
pip install --user numpy opencv-python scikit-image numba
```
[Optional] GPU acceleration requires an NVIDA GPU with CUDA and PyCUDA:
```
pip install --user pycuda
```

Demo

This demo fuses 1000 RGB-D images from the 7-scenes dataset into a 405 x 264 x 289 projective TSDF voxel volume with 2cm resolution at about 30 FPS in GPU mode (0.4 FPS in CPU mode), and outputs a 3D mesh mesh.ply which can be visualized with a 3D viewer like Meshlab.

Note: color images are saved as 24-bit PNG RGB, depth images are saved as 16-bit PNG in millimeters.

python demo.py

Seen In

References

Citing

This repository is a part of 3DMatch Toolbox. If you find this code useful in your work, please consider citing:

@inproceedings{zeng20163dmatch,
    title={3DMatch: Learning Local Geometric Descriptors from RGB-D Reconstructions},
    author={Zeng, Andy and Song, Shuran and Nie{\ss}ner, Matthias and Fisher, Matthew and Xiao, Jianxiong and Funkhouser, Thomas},
    booktitle={CVPR},
    year={2017}
}

Python code to fuse multiple RGB-D images into a TSDF voxel volume.

Related tags

Overview

Volumetric TSDF Fusion of RGB-D Images in Python

Requirements

Demo

Seen In

References

Citing

Owner

Andy Zeng

PyTorch implementation of residual gated graph ConvNets, ICLR’18

Spearmint Bayesian optimization codebase

Normalization Calibration (NorCal) for Long-Tailed Object Detection and Instance Segmentation

This is a repo of basic Machine Learning!

Python Fanduel API (2021) - Lineup Automation

Deep Text Search is an AI-powered multilingual text search and recommendation engine with state-of-the-art transformer-based multilingual text embedding (50+ languages).

Deep learning with dynamic computation graphs in TensorFlow

SSL_SLAM2: Lightweight 3-D Localization and Mapping for Solid-State LiDAR (mapping and localization separated) ICRA 2021

Extreme Lightwegith Portrait Segmentation

Code for NeurIPS 2021 paper: Invariant Causal Imitation Learning for Generalizable Policies

Deep Learning Datasets Maker is a QGIS plugin to make datasets creation easier for raster and vector data.

Official Python implementation of the 'Sparse deconvolution'-v0.3.0

Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561

True Few-Shot Learning with Language Models

Sparse R-CNN: End-to-End Object Detection with Learnable Proposals, CVPR2021

unofficial pytorch implement of "Squareplus: A Softplus-Like Algebraic Rectifier"

Simple keras FCN Encoder/Decoder model for MS-COCO (food subset) segmentation

Official implementation of the article "Unsupervised JPEG Domain Adaptation For Practical Digital Forensics"

This package contains deep learning models and related scripts for RoseTTAFold

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021