Code and data of the Fine-Grained R2R Dataset proposed in paper Sub-Instruction Aware Vision-and-Language Navigation

Last update: Nov 15, 2022

Overview

Fine-Grained R2R

Code and data of the Fine-Grained R2R Dataset proposed in the EMNLP2020 paper Sub-Instruction Aware Vision-and-Language Navigation.

Code of the navigator will be released soon.

This dataset enriches the benchmark Room-to-Room (R2R) dataset by dividing the instructions into sub-instructions and pairing each of those with their corresponding viewpoints in the path.

The copyright resides with the authors of the paper Sub-Instruction Aware Vision-and-Language Navigation.
This dataset is build upon the Room-to-Room (R2R) dataset, we refer the readers to its repository for more details.

Data

The Fine-Grained R2R data, which enriches the R2R dataset with sub-instructions and their corresponding paths. The overall instruction and trajectory of each sample remains the same.

For paths in the train, the validation seen and the validation unseen splits, we add two new entries:
- new_instructions: A list of sub-instructions produced by the Chunking Function from the complete instructions. You can use import ast and ast.literal_eval() to read it a list.
- chunk_view: A list of sub-paths corresponding to the sub-instructions, where each number in the list is an index of a viewpoint in the ground-truth path. The index starts at 1.
Some sub-instructions which refer to camera rotation or a STOP action could match to a single viewpoint.
For the test unseen split, we only provide the sub-instructions but not the sub-paths.

Source

The code of the proposed Chunking Function for generating sub-instructions.

Install the StanfordNLP package (v0.1.2 in our experiment) and download the English models for the neural pipeline.
Run make_subinstr.py to generate data with sub-instructions from the original R2R data.
The generated files had been sent to the Amazon Mechanical Turk (AMT) for annotating the sub-paths.

Reference

If you use or dicsuss the Fine-Grained R2R dataset in your work, please cite our paper:

@article{hong2020sub,
  title={Sub-Instruction Aware Vision-and-Language Navigation},
  author={Hong, Yicong and Rodriguez-Opazo, Cristian and Wu, Qi and Gould, Stephen},
  journal={arXiv preprint arXiv:2004.02707},
  year={2020}
}

Contact

If you have any question regarding the dataset or publication, please create an issue in this repository or email to [email protected].

Code and data of the Fine-Grained R2R Dataset proposed in paper Sub-Instruction Aware Vision-and-Language Navigation

Related tags

Overview

Fine-Grained R2R

Data

Source

Reference

Contact

Owner

YicongHong

A clear, concise, simple yet powerful and efficient API for deep learning.

Few-Shot Graph Learning for Molecular Property Prediction

Rate-limit-semaphore - Semaphore implementation with rate limit restriction for async-style (any core)

WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU

Real-time ground filtering algorithm of cloud points acquired using Terrestrial Laser Scanner (TLS)

yolov5 deepsort 行人车辆跟踪检测计数

Constrained Language Models Yield Few-Shot Semantic Parsers

NCVX (NonConVeX): A User-Friendly and Scalable Package for Nonconvex Optimization in Machine Learning.

Simple embedding based text classifier inspired by fastText, implemented in tensorflow

[PyTorch] Official implementation of CVPR2021 paper "PointDSC: Robust Point Cloud Registration using Deep Spatial Consistency". https://arxiv.org/abs/2103.05465

Official implementation for the paper "SAPE: Spatially-Adaptive Progressive Encoding for Neural Optimization".

Official implementation of "Synthetic Temporal Anomaly Guided End-to-End Video Anomaly Detection" (ICCV Workshops 2021: RSL-CV).

MNIST, but with Bezier curves instead of pixels

Deep deconfounded recommender (Deep-Deconf) for paper "Deep causal reasoning for recommendations"

PyTorch Implementation for "ForkGAN with SIngle Rainy NIght Images: Leveraging the RumiGAN to See into the Rainy Night"

Rede Neural Convolucional feita durante o processo seletivo do Laboratório de Inteligência Artificial da FACOM (UFMS)

Meta Learning for Semi-Supervised Few-Shot Classification

Existing Literature about Machine Unlearning

Pytorch Implementation of the paper "Cross-domain Correspondence Learning for Exemplar-based Image Translation"

Repository of best practices for deep learning in Julia, inspired by fastai

Code and data of the Fine-Grained R2R Dataset proposed in paper Sub-Instruction Aware Vision-and-Language Navigation

Related tags

Overview

Fine-Grained R2R

Data

Source

Reference

Contact

Owner

YicongHong

A clear, concise, simple yet powerful and efficient API for deep learning.

Few-Shot Graph Learning for Molecular Property Prediction

Rate-limit-semaphore - Semaphore implementation with rate limit restriction for async-style (any core)

WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU

Real-time ground filtering algorithm of cloud points acquired using Terrestrial Laser Scanner (TLS)

yolov5 deepsort 行人 车辆 跟踪 检测 计数

Constrained Language Models Yield Few-Shot Semantic Parsers

NCVX (NonConVeX): A User-Friendly and Scalable Package for Nonconvex Optimization in Machine Learning.

Simple embedding based text classifier inspired by fastText, implemented in tensorflow

[PyTorch] Official implementation of CVPR2021 paper "PointDSC: Robust Point Cloud Registration using Deep Spatial Consistency". https://arxiv.org/abs/2103.05465

Official implementation for the paper "SAPE: Spatially-Adaptive Progressive Encoding for Neural Optimization".

Official implementation of "Synthetic Temporal Anomaly Guided End-to-End Video Anomaly Detection" (ICCV Workshops 2021: RSL-CV).

MNIST, but with Bezier curves instead of pixels

Deep deconfounded recommender (Deep-Deconf) for paper "Deep causal reasoning for recommendations"

PyTorch Implementation for "ForkGAN with SIngle Rainy NIght Images: Leveraging the RumiGAN to See into the Rainy Night"

Rede Neural Convolucional feita durante o processo seletivo do Laboratório de Inteligência Artificial da FACOM (UFMS)

Meta Learning for Semi-Supervised Few-Shot Classification

Existing Literature about Machine Unlearning

Pytorch Implementation of the paper "Cross-domain Correspondence Learning for Exemplar-based Image Translation"

Repository of best practices for deep learning in Julia, inspired by fastai

yolov5 deepsort 行人车辆跟踪检测计数