Answering Open-Domain Questions of Varying Reasoning Steps from Text

Last update: Dec 22, 2022

Related tags

Overview

IRRR

This repository contains the authors' implementation of the Iterative Retriever, Reader, and Reranker (IRRR) model in the EMNLP 2021 paper "Answering Open-Domain Questions of Varying Reasoning Steps from Text".

Run prediction on BeerQA

Setting up

Checkout the code from our repository using

git clone https://github.com/beerqa/IRRR.git

This repo requires Python 3.6. Please check your shell environment's python before proceeding. To use ElasticSearch, make sure you also install Java Development Kit (JDK) version 8.

The setup script will download all required dependencies (python requirements, data, models, etc.) required to run the IRRR pipeline end-to-end. Before running this script, make sure you have the Unix utility wget (which can be installed through anaconda as well as other common package managers). Along the way, it will also start running Elasticsearch and index the Wikipedia corpus locally.

Note: This might take a while to finish and requires a large amount of disk space, so it is strongly recommended that you run this on a machine with at least 100GB of free disk space.

bash setup.sh

Run prediction

Here is a quick example for running prediction using our trained model on the BeerQA dataset. It will take hours depending on the number of retrieved passages at each iteration. It requires up-to 100GB of storage for storing intermediate files when are large number of passages are retrieved at each reasoning step

bash scripts/predict_dynamic_hops.sh PREDICT_OUTPUT_PATH \
                                     ./data/beerqa/beerqa_dev_v1.0.json \
				     MODEL_OUTPUT_PATH \
				     NUM_PASSAGES_AT_EACH_ITERATION \
				     MAX_ITERATION

Evaluate the prediction

Once the prediction has been made, you can use the following command to evaluate the output

python utils/eval_beerqa.py ./data/beerqa/beerqa_dev_v1.0.json \
                            PREDICT_OUTPUT_PATH/answer_predictions.json

Citation

If you use IRRR in your work, please consider citing our paper

@inproceedings{qi2021answering,
  title={Answering Open-Domain Questions of Varying Reasoning Steps from Text},
  author = {Qi, Peng and Lee, Haejun and Sido, Oghenetegiri "TG" and Manning, Christopher D.},
  booktitle = {Empirical Methods for Natural Language Processing ({EMNLP})},
  year = {2021}
}

License

All work contained in this package is licensed under the Apache License, Version 2.0. See the included LICENSE file.

Answering Open-Domain Questions of Varying Reasoning Steps from Text

Related tags

Overview

IRRR

Run prediction on BeerQA

Setting up

Run prediction

Evaluate the prediction

Citation

License

Owner

PassAPI is a password generator in hash format and fully developed in Python, with the aim of teaching how to handle and build

Official implementation of the paper "Lightweight Deep CNN for Natural Image Matting via Similarity Preserving Knowledge Distillation"

Safe Model-Based Reinforcement Learning using Robust Control Barrier Functions

Code for “ACE-HGNN: Adaptive Curvature ExplorationHyperbolic Graph Neural Network”

Official Pytorch implementation of Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations

Corruption Invariant Learning for Re-identification

Adaptive FNO transformer - official Pytorch implementation

From Fidelity to Perceptual Quality: A Semi-Supervised Approach for Low-Light Image Enhancement (CVPR'2020)

Dynamics-aware Adversarial Attack of 3D Sparse Convolution Network

It is a simple library to speed up CLIP inference up to 3x (K80 GPU)

NLMpy - A Python package to create neutral landscape models

Implementation of Neural Distance Embeddings for Biological Sequences (NeuroSEED) in PyTorch

The Environment I built to study Reinforcement Learning + Pokemon Showdown

ROMP: Monocular, One-stage, Regression of Multiple 3D People, ICCV21

Implementation for the EMNLP 2021 paper "Interactive Machine Comprehension with Dynamic Knowledge Graphs".

ComPhy: Compositional Physical Reasoning ofObjects and Events from Videos

Demonstrational Session git repo for H SAF User Workshop (28/1)

A PyTorch implementation of unsupervised SimCSE

Rename Images with Auto Generated Neural Image Captions

Distance correlation and related E-statistics in Python