Code repo for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

Last update: Aug 12, 2022

Related tags

Deep Learning InterpretableMDE

Overview

InterpretableMDE

A PyTorch implementation for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

arXiv link: https://arxiv.org/abs/2108.05312

Data and Model

For MFF models, we use the dataset they released here, and you can download their models as the baselines here. For BTS models, they use a different set of NYUv2 training images (24,231 instead of 50,688), and you download it here. We put all of our models here.

Evaluation

In this project we use yacs to manage the configurations. To evaluate the performance of a model, for example, the MFF model with SENet backbone using our assigning method, simply run

python eval.py MODEL_WEIGHTS_FILE [PATH_TO_MODEL/mff_senet_asn]

from the root directory.

To evaluate the depth selectivity, run

python dissect.py MODEL_WEIGHTS_FILE [PATH_TO_MODEL/mff_senet_asn] LAYERS D_MFF ON_TRAINING_DATA True

then get the depth selectivity and the dissection result of each unit. Layers' names are seperated by _.

Training

To train a model from scratch, run

python train.py MODEL_NAME MFF_resnet

We currently provide four options for MODEL_NAME, and the training scheme will automatically be switched to align with the original ones when using BTS models.

Acknowledgement

The model part of our code is adapted from Revisiting_Single_Depth_Estimation and bts. Some snippets are adapted from monodepth2.

Bibtex

@inproceedings{you2021iccv,
 title = {Towards Interpretable Deep Networks for Monocular Depth Estimation},
 author = {Zunzhi You and Yi-Hsuan Tsai and Wei-Chen Chiu and Guanbin Li},
 booktitle = {International Conference on Computer Vision (ICCV)},
 year = {2021}
}

Code repo for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

Related tags

Overview

InterpretableMDE

Data and Model

Evaluation

Training

Acknowledgement

Bibtex

Owner

Zunzhi You

Source code for EquiDock: Independent SE(3)-Equivariant Models for End-to-End Rigid Protein Docking (ICLR 2022)

This repository contains implementations and illustrative code to accompany DeepMind publications

Spatial color quantization in Rust

This repo includes our code for evaluating and improving transferability in domain generalization (NeurIPS 2021)

[ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds

ShuttleNet: Position-aware Fusion of Rally Progress and Player Styles for Stroke Forecasting in Badminton (AAAI 2022)

Fusion-DHL: WiFi, IMU, and Floorplan Fusion for Dense History of Locations in Indoor Environments

Official Implementation of HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation

This is the open-source reference implementation of the SIGGRAPH 2021 paper Intersection-free Rigid Body Dynamics.

Code I use to automatically update my videos' metadata on YouTube

A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)

Dogs classification with Deep Metric Learning using some popular losses

MPRNet-Cloud-removal: Progressive cloud removal

Huawei Hackathon 2021 - Sweden (Stockholm)

Generative Flow Networks for Discrete Probabilistic Modeling

This is the repository for paper NEEDLE: Towards Non-invertible Backdoor Attack to Deep Learning Models.

Proximal Backpropagation - a neural network training algorithm that takes implicit instead of explicit gradient steps

Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)

Official PyTorch implementation of "Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image", ICCV 2019

SeqFormer: a Frustratingly Simple Model for Video Instance Segmentation