Repository for "Toward Practical Monocular Indoor Depth Estimation" (CVPR 2022)

Last update: Dec 13, 2022

Related tags

Deep Learning DistDepth

Overview

Toward Practical Monocular Indoor Depth Estimation

Cho-Ying Wu, Jialiang Wang, Michael Hall, Ulrich Neumann, Shuochen Su

[arXiv] [project site]

DistDepth

Our DistDepth is a highly robust monocular depth estimation approach for generic indoor scenes.

Trained with stereo sequences without their groundtruth depth
Structured and metric-accurate
Run in an interactive rate with Laptop GPU
Sim-to-real: trained on simulation and becomes transferrable to real scenes

Single Image Inference Demo

We test on Ubuntu 20.04 LTS with an laptop NVIDIA 2080 GPU (only GPU mode is supported).

Install packages

Use conda

conda create --name distdepth python=3.8 conda activate distdepth
Install pre-requisite common packages. Go to https://pytorch.org/get-started/locally/ and install pytorch that is compatible to your computer. We test on pytorch v1.9.0 and cudatoolkit-11.1. (The codes should work under other v1.0+ versions)

conda install pytorch==1.9.0 torchvision==0.10.0 torchaudio==0.9.0 cudatoolkit=11.3 -c pytorch -c conda-forge
Install other dependencies: opencv-python and matplotlib.

pip install opencv-python, matplotlib

Download pretrained models

Download pretrained models [here] (ResNet152, 246MB).
Move the downloaded item under this folder, and then unzip it. You should be able to see a new folder 'ckpts' that contains the pretrained models.
Run

python demo.py
Results will be stored under results/

Data

Download SimSIN [here]. For UniSIN and VA, please download at the [project site].

Depth-aware AR effects

Virtual object insertion:

Dragging objects along a trajectory:

Citation

@inproceedings{wu2022toward,
title={Toward Practical Monocular Indoor Depth Estimation},
author={Wu, Cho-Ying and Wang, Jialiang and Hall, Michael and Neumann, Ulrich and Su, Shuochen},
booktitle={CVPR},
year={2022}
}

License

DistDepth is CC-BY-NC licensed, as found in the LICENSE file.

Repository for "Toward Practical Monocular Indoor Depth Estimation" (CVPR 2022)

Related tags

Overview

Toward Practical Monocular Indoor Depth Estimation

DistDepth

Single Image Inference Demo

Data

Depth-aware AR effects

Citation

License

Owner

Meta Research

PSANet: Point-wise Spatial Attention Network for Scene Parsing, ECCV2018.

This is a pytorch implementation for the BST model from Alibaba https://arxiv.org/pdf/1905.06874.pdf

Keras Implementation of The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation by (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio)

Codebase of deep learning models for inferring stability of mRNA molecules

RetinaNet-PyTorch - A RetinaNet Pytorch Implementation on remote sensing images and has the similar mAP result with RetinaNet in MMdetection

Siamese-nn-semantic-text-similarity - A repository containing comprehensive Neural Networks based PyTorch implementations for the semantic text similarity task

Conditional Gradients For The Approximately Vanishing Ideal

RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.

Pytorch implementation of "Neural Wireframe Renderer: Learning Wireframe to Image Translations"

Official PyTorch implementation of "BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation" (NeurIPS 2021)

The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.

Open-AI's DALL-E for large scale training in mesh-tensorflow.

InsCLR: Improving Instance Retrieval with Self-Supervision

GAN example for Keras. Cuz MNIST is too small and there should be something more realistic.

Neighbor2Seq: Deep Learning on Massive Graphs by Transforming Neighbors to Sequences

A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population

Generating Videos with Scene Dynamics

MQBench Quantization Aware Training with PyTorch

PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and reinforcement learning

Gin provides a lightweight configuration framework for Python