[ICRA2021] Reconstructing Interactive 3D Scene by Panoptic Mapping and CAD Model Alignment

Last update: Dec 28, 2022

Overview

Interactive Scene Reconstruction

Project Page | Paper

This repository contains the implementation of our ICRA2021 paper Reconstructing Interactive 3D Scenes by Panoptic Mapping and CAD Model Alignments. The proposed pipeline reconstructs an interactive indoor scene from RGBD streams, where objects are replaced by (articulated) CAD models. Represented as a contact graph, the reconstructed scene naturally encodes actionable information in terms of environmental kinematics, and can be imported into various simulators to support robot interactions.

The pipeline consists of 3 modules:

A robust panoptic mapping module that accurately reconstruct the semantics and geometry of objects and layouts, which is a modified version of Voxblox++ but with improved robustness. The 2D image segmentation is obtained using [Detectron2] (https://github.com/facebookresearch/detectron2)
An object-based reasoning module that constructs a contact graph from the dense panoptic map and replaces objects with aligned CAD models
An interface that converts a contact graph into a kinematic tree in the URDF format, which can be imported into ROS-based simulators

Todo

Upload code for panoptic mapping
Upload submodules for panoptic mapping
Upload code for CAD replacement
Upload code for URDF conversion and scene visualization
Upload dataset and use cases
Update instructions

1. Installation

1.1 Prerequisites

Ubuntu 16.04 (with ROS Kinetic) or 18.04 (with ROS Melodic)
Python >= 3.7
gcc & g++ >= 5.4
3 <= OpenCV < 4
(Optional) Nvidia GPU (with compatible cuda toolkit and cuDNN) if want to run online segmentation

1.2 Clone the repository & install catkin dependencies

First create and navigate to your catkin workspace

cd <your-working-directory>
mkdir <your-ros-ws>/src && cd <your-ros-ws>

Then, initialize the workspace and configure it. (Remember to replace by your ros version)

catkin init
catkin config --extend /opt/ros/<your-ros-version> --merge-devel 
catkin config --cmake-args -DCMAKE_CXX_STANDARD=14 -DCMAKE_BUILD_TYPE=Release

Download this repository to your ROS workspace src/ folder with submodules via:

cd src
git clone --recursive https://github.com/hmz-15/Interactive-Scene-Reconstruction.git

Then add dependencies specified by .rosinstall using wstool

cd Interactive-Scene-Reconstruction
wstool init dependencies
cd dependencies
wstool merge -t . ../mapping/voxblox-plusplus/voxblox-plusplus_https.rosinstall
wstool merge -t . ../mapping/orb_slam2_ros/orb_slam2_ros_https.rosinstall
wstool update

1.3 Build packages

cd <your-ros-ws>
catkin build orb_slam2_ros perception_ros gsm_node -j2

[ICRA2021] Reconstructing Interactive 3D Scene by Panoptic Mapping and CAD Model Alignment

Related tags

Overview

Interactive Scene Reconstruction

Project Page | Paper

Todo

1. Installation

1.1 Prerequisites

1.2 Clone the repository & install catkin dependencies

1.3 Build packages

Owner

Implementation of Sequence Generative Adversarial Nets with Policy Gradient

[ICCV 2021 (oral)] Planar Surface Reconstruction from Sparse Views

PyTorch implementation of MLP-Mixer

LLVIP: A Visible-infrared Paired Dataset for Low-light Vision

HashNeRF-pytorch - Pure PyTorch Implementation of NVIDIA paper on Instant Training of Neural Graphics primitives

Implementation of "Fast and Flexible Temporal Point Processes with Triangular Maps" (Oral @ NeurIPS 2020)

This is an official implementation for "Self-Supervised Learning with Swin Transformers".

Have you ever wondered how cool it would be to have your own A.I

PyTorch implementation of the end-to-end coreference resolution model with different higher-order inference methods.

We present a regularized self-labeling approach to improve the generalization and robustness properties of fine-tuning.

Experiments with Fourier layers on simulation data.

Linear image-to-image translation

1st Place Solution to ECCV-TAO-2020: Detect and Represent Any Object for Tracking

Source code, datasets and trained models for the paper Learning Advanced Mathematical Computations from Examples (ICLR 2021), by François Charton, Amaury Hayat (ENPC-Rutgers) and Guillaume Lample

This code is for eCaReNet: explainable Cancer Relapse Prediction Network.

Code of Classification Saliency-Based Rule for Visible and Infrared Image Fusion

ConE: Cone Embeddings for Multi-Hop Reasoning over Knowledge Graphs

When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset of 53,000+ Legal Holdings

Paaster is a secure by default end-to-end encrypted pastebin built with the objective of simplicity.

gACSON software for visualization, processing and analysis of three-dimensional electron microscopy images