Pathdreamer: A World Model for Indoor Navigation

Last update: Jan 04, 2023

Related tags

Deep Learning pathdreamer

Overview

Pathdreamer: A World Model for Indoor Navigation

This repository hosts the open source code for Pathdreamer, to be presented at ICCV 2021.

Paper | Project Webpage | Colab Demo

Setup instructions

Environment

Set up virtualenv, and install required libraries:

virtualenv venv
source venv/bin/activate
pip install -r requirements.txt

Add the Pathdreamer library to PYTHONPATH:

export PYTHONPATH=$PYTHONPATH:/home/path/to/pathdreamer_root/

Downloading Pretrained Checkpoints

We provide a pretrained checkpoint which can be acquired by running:

wget https://storage.googleapis.com/gresearch/pathdreamer/ckpt.tar -P data/
tar -xf data/ckpt.tar --directory data/

The results will be extracted to the data/ckpt directory. Two checkpoints are provided, one for the Stage 1 model (Structure Generator), and another for the Stage 2 model (Image Generator).

Colab Demo

Pathdreamer_Example_Colab.ipynb [click to launch in Google Colab] shows how to setup and run the pretrained Pathdreamer model for inference. It includes examples on synthesizing image sequences and continuous video sequences for arbitrary navigation trajectories.

Citation

If you find this work useful, please consider citing:

@inproceedings{koh2021pathdreamer,
  title={Pathdreamer: A World Model for Indoor Navigation},
  author={Koh, Jing Yu and Lee, Honglak and Yang, Yinfei and Baldridge, Jason and Anderson, Peter},
  journal={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
  year={2021}
}

License

Pathdreamer is released under the Apache 2.0 license. The Matterport3D dataset is governed by the Matterport3D Terms of Use.

Disclaimer

Not an official Google product.

Pathdreamer: A World Model for Indoor Navigation

Related tags

Overview

Pathdreamer: A World Model for Indoor Navigation

Setup instructions

Environment

Downloading Pretrained Checkpoints

Colab Demo

Citation

License

Disclaimer

Owner

Google Research

Code for ICDM2020 full paper: "Sub-graph Contrast for Scalable Self-Supervised Graph Representation Learning"

Learning kernels to maximize the power of MMD tests

HomoInterpGAN - Homomorphic Latent Space Interpolation for Unpaired Image-to-image Translation

PyTorch implementation of GLOM

This is a collection of our NAS and Vision Transformer work.

Julia package for contraction of tensor networks, based on the sweep line algorithm outlined in the paper General tensor network decoding of 2D Pauli codes

This repo holds the code of TransFuse: Fusing Transformers and CNNs for Medical Image Segmentation

A fast model to compute optical flow between two input images.

A complete end-to-end demonstration in which we collect training data in Unity and use that data to train a deep neural network to predict the pose of a cube. This model is then deployed in a simulated robotic pick-and-place task.

MiraiML: asynchronous, autonomous and continuous Machine Learning in Python

Mix3D: Out-of-Context Data Augmentation for 3D Scenes (3DV 2021)

A semantic segmentation toolbox based on PyTorch

Official PyTorch implementation of "RMGN: A Regional Mask Guided Network for Parser-free Virtual Try-on" (IJCAI-ECAI 2022)

Implementation of [Time in a Box: Advancing Knowledge Graph Completion with Temporal Scopes].

Python Blood Vessel Topology Analysis

AntroPy: entropy and complexity of (EEG) time-series in Python

Fast Learning of MNL Model From General Partial Rankings with Application to Network Formation Modeling

Hybrid Neural Fusion for Full-frame Video Stabilization

Mosaic of Object-centric Images as Scene-centric Images (MosaicOS) for long-tailed object detection and instance segmentation.

particle tracking model, works with the ROMS output file(qck.nc, his.nc)