the official code for ICRA 2021 Paper: "Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation"

Last update: Jul 27, 2022

Related tags

Deep Learning G2S

Overview

G2S

This is the official code for ICRA 2021 Paper: Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation by Hemang Chawla, Arnav Varma, Elahe Arani and Bahram Zonooz.

G2S (GPS-to-Scale) Loss is a dynamically-weighted loss that can be added to the appearance-based losses to train any monocular self-supervised depth estimation architecture to get scale-consistant and scale-aware depth estimates at inference.

Here, we provide helper GPS dataloader and the G2S loss classes for using this loss with any model.

For details, please see the Paper and Presentation.

KITTI GPS

The GPS files containing geodesic gps information of raw kitti dataset in local coordinates for training with the g2s loss can be found in the assets folder as kitti_gps_raw.zip.
Unzip the file at /path/to/KITTI/raw_data/sync to merge the GPS files in the expected directory tree structure.

Usage

You can use the G2S class in lossG2S.py within your project for scale-consistent and -aware predictions. This requires using the copresent GPS modality along with images. To load the GPS, please adopt the GPSDataloader class within dataloaderGPS.py into your images dataloader.

Cite Our Work

If you find the code useful in your research, please consider citing our paper:

@inproceedings{chawlavarma2021multimodal,
	author={H. {Chawla} and A. {Varma} and E. {Arani} and B. {Zonooz}},
	booktitle={2021 IEEE International Conference on Robotics and Automation (ICRA)},
	title={Multimodal Scale Consistency and Awareness for Monocular Self-Supervised
	Depth Estimation},
	location={Xi’an, China},
	publisher={IEEE (in press)},
	year={2021}
}

License

This project is licensed under the terms of the MIT license.

the official code for ICRA 2021 Paper: "Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation"

Related tags

Overview

G2S

KITTI GPS

Usage

Cite Our Work

License

Owner

NeurAI

Library for implementing reservoir computing models (echo state networks) for multivariate time series classification and clustering.

NeuralCompression is a Python repository dedicated to research of neural networks that compress data

The code for the NSDI'21 paper "BMC: Accelerating Memcached using Safe In-kernel Caching and Pre-stack Processing".

Apply our monocular depth boosting to your own network!

Quickly comparing your image classification models with the state-of-the-art models (such as DenseNet, ResNet, ...)

Rotary Transformer

FirmWire is a full-system baseband firmware emulation platform for fuzzing, debugging, and root-cause analysis of smartphone baseband firmwares

Real-time Joint Semantic Reasoning for Autonomous Driving

This repository implements variational graph auto encoder by Thomas Kipf.

Rainbow DQN implementation that outperforms the paper's results on 40% of games using 20x less data 🌈

A dead simple python wrapper for darknet that works with OpenCV 4.1, CUDA 10.1

Papers about explainability of GNNs

这是一个mobilenet-yolov4-lite的库，把yolov4主干网络修改成了mobilenet，修改了Panet的卷积组成，使参数量大幅度缩小。

EvDistill: Asynchronous Events to End-task Learning via Bidirectional Reconstruction-guided Cross-modal Knowledge Distillation (CVPR'21)

Multivariate Time Series Transformer, public version

Official repo for SemanticGAN https://nv-tlabs.github.io/semanticGAN/

Stacs-ci - A set of modules to enable integration of STACS with commonly used CI / CD systems

FairMOT - A simple baseline for one-shot multi-object tracking

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

torchbearer: A model fitting library for PyTorch