The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".

Last update: Oct 30, 2022

Overview

Code for "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval" (ACL 2021, Long)

This is the repository for baseline models and annotated data for this paper: Akari Asai and Eunsol Choi. Challenges in Information Seeking QA:Unanswerable Questions and Paragraph Retrieval. In: Proceedings of ACL. 2021

In the paper, we carefully analyze unanswerable questions in information-seeking QA dataset (i.e., Natural Questions and TyDi QA) and attempt to identify the remaining headrooms. We conduct both a range of controlled experiments and insensitive human annotations on around 800 examples across across 6 languages.

Annotated data

In human_annotated_data, we provide human annotated data from TyDi QA and Natural Questions.

Dataset	language	# of annotated questions	file name
Natural Questions	English	450	NQ.tsv
TyDi QA	Bengali	50	TyDi-Bn.tsv
TyDi QA	Japanese	100	TyDi-Ja.tsv
TyDi QA	Korean	100	TyDi-Bn.tsv
TyDi QA	Russian	50	TyDi-Ru.tsv
TyDi QA	Telugu	50	TyDi-Te.tsv

Baselines

In this work, we conduct several baseline experiments to identify the remaining headrooms in information-seeking QA. This repository include baselines for question only baseline. See the training and evaluation details in README.md. We thank the authors of Riki Net, Retro-reader, and ETC for providing their models' predictions that are used to analyze those state-of-the-art models behaviors.

Citation and Contact

If you find this codebase is useful or use in your work, please cite our paper.

@inproceedings{
asai2020learning,
title={Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval},
author={Akari Asai and Eunsol Choi},
booktitle={ACL-IJCNLP},
year={2021}
}

Please contact Akari Asai (@AkariAsai, akari[at]cs.washington.edu) for questions and suggestions.

The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".

Related tags

Overview

Code for "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval" (ACL 2021, Long)

Annotated data

Baselines

Citation and Contact

Owner

Akari Asai

Official pytorch implementation of "Feature Stylization and Domain-aware Contrastive Loss for Domain Generalization" ACMMM 2021 (Oral)

Code for reproducing key results in the paper "InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets"

Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)

A PyTorch implementation of SlowFast based on ICCV 2019 paper "SlowFast Networks for Video Recognition"

Implementation of Pooling by Sliced-Wasserstein Embedding (NeurIPS 2021)

Re-implememtation of MAE (Masked Autoencoders Are Scalable Vision Learners) using PyTorch.

Baseline of DCASE 2020 task 4

Diffusion Normalizing Flow (DiffFlow) Neurips2021

[ICLR'19] Trellis Networks for Sequence Modeling

StarGAN v2-Tensorflow - Simple Tensorflow implementation of StarGAN v2

AirLoop: Lifelong Loop Closure Detection

This is the code of "Multi-view Contrastive Graph Clustering" in NeurlPS 2021.

EdMIPS: Rethinking Differentiable Search for Mixed-Precision Neural Networks

Code and models for "Rethinking Deep Image Prior for Denoising" (ICCV 2021)

Code implementation of "Sparsity Probe: Analysis tool for Deep Learning Models"

DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

Adversarial Graph Augmentation to Improve Graph Contrastive Learning

A method that utilized Generative Adversarial Network (GAN) to interpret the black-box deep image classifier models by PyTorch.

DeepLM: Large-scale Nonlinear Least Squares on Deep Learning Frameworks using Stochastic Domain Decomposition (CVPR 2021)

ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. In ICCV, 2021.