Measuring if attention is explanation with ROAR

Last update: Nov 13, 2022

Related tags

Overview

NLP ROAR Interpretability

Official code for: Evaluating the Faithfulness of Importance Measures in NLP by Recursively Masking Allegedly Important Tokens and Retraining

Install

git clone https://github.com/AndreasMadsen/nlp-roar-interpretability.git
cd nlp-roar-interpretability
python -m pip install -e .

Experiments

Tasks

There are scripts for each dataset. Note that some tasks share a dataset. Use this list to identify how to train a model for each task.

SST: python experiments/stanford_sentiment.py
SNLI: python experiments/stanford_nli.py
IMDB: python experiments/imdb.py
MIMIC (Diabetes): python experiments/mimic.py --subset diabetes
MIMIC (Anemia): python experiments/mimic.py --subset anemia
bABI-1: python experiments/babi.py --task 1
bABI-2: python experiments/babi.py --task 2
bABI-3: python experiments/babi.py --task 3

Parameters

Each of the above scripts stanford_sentiment, stanford_nli, imdb, mimic, and babi take the same set of CLI arguments. You can learn about each argument with --help. The most important arguments which will allow you to run the experiments presented in the paper are:

--importance-measure: this specifies which importance measure is used. It can be either random, mutual-information, attention , gradient, or integrated-gradient.
--seed: specifies the seed used to initialize the model.
--roar-strategy: should ROAR masking be done absoloute (count) or relative (quantile),
--k: the proportion of tokens in % to mask if --roar-strategy quantile is used. The number of tokens if --roar-strategy count is used.
--recursive: indicates that model to use for computing the importance measure has --k set to --k - --recursive-step-size instead of 0 as used in classic ROAR.

Note, for --k > 0, the reference model must already be trained. For example, in the non-recursive case, this means that a model trained with --k 0 must already available.

Running on a HPC setup

For downloading dataset dependencies we provide a download.sh script.

Additionally, we provide script for submitting all jobs to a Slurm queue, in batch_jobs/. Note again, that the ROAR script assume there are checkpoints for the baseline --k 0 models.

The jobs automatically use $SCRATCH/nlproar as the presistent dir.

MIMIC

See https://mimic.physionet.org/gettingstarted/access/ for how to access MIMIC. You will need to download DIAGNOSES_ICD.csv.gz and NOTEEVENTS.csv.gz and place them in mimic/ relative to your presistent dir.

Measuring if attention is explanation with ROAR

Related tags

Overview

NLP ROAR Interpretability

Install

Experiments

Tasks

Parameters

Running on a HPC setup

MIMIC

Owner

Andreas Madsen

The openspoor package is intended to allow easy transformation between different geographical and topological systems commonly used in Dutch Railway

Code repo for "FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation" (ICCV 2021)

Neural Cellular Automata + CLIP

🛠 All-in-one web-based IDE specialized for machine learning and data science.

SMCA replication There are no extra compiled components in SMCA DETR and package dependencies are minimal

DirectVoxGO reconstructs a scene representation from a set of calibrated images capturing the scene.

Detail-Preserving Transformer for Light Field Image Super-Resolution

Implementation of the Chamfer Distance as a module for pyTorch

Self-Supervised Monocular DepthEstimation with Internal Feature Fusion(arXiv), BMVC2021

PoseCamera is python based SDK for human pose estimation through RGB webcam.

Devkit for 3D -- Some utils for 3D object detection based on Numpy and Pytorch

Semantic graph parser based on Categorial grammars

Accurate identification of bacteriophages from metagenomic data using Transformer

PyTorch Live is an easy to use library of tools for creating on-device ML demos on Android and iOS.

Out-of-Town Recommendation with Travel Intention Modeling (AAAI2021)

Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style

Detecting and Tracking Small and Dense Moving Objects in Satellite Videos: A Benchmark

End-To-End Memory Network using Tensorflow

CLIPort: What and Where Pathways for Robotic Manipulation

这是一个利用facenet和retinaface实现人脸识别的库，可以进行在线的人脸识别。