Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Last update: Oct 27, 2022

Related tags

Overview

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Overview of paths used in DIG and IG. w is the word being attributed. The gray region is the neighborhood of w. Green line depicts the straight-line path from w to w' used by IG and the green squares are the corresponding interpolation points. Left: In DIG-Greedy, we first monotonize each word in the neighborhood (red arrow). Then the word closest to its corresponding monotonic point is selected as the anchor (blue line to w_5 since the red arrow of w_5 has the shortest magnitude). Right: In DIG-MaxCount we first count the number of monotonic dimensions for each word in the neighborhood (shown in [.] above). Then, the word with the highest number of monotonic dimensions is selected as the anchor word (blue line to w_4), followed by changing the non-monotonic dimensions of w_4 (red line to c). Repeating this step gives the zigzag blue path. Finally, the red stars are the interpolated points used by our method. Please refer to the paper for more details.

Dependencies

Dependencies can be installed using requirements.txt.

Evaluating DIG:

Install all the requirements from requirements.txt.
Execute ./setup.sh for setting up the folder hierarchy for experiments.

Commands for reproducing the reported results on DistilBERT fine-tuned on SST2:

# Generate the KNN graph
python knn.py -dataset sst2 -nn distilbert

# DIG (strategy: Greedy)
python main.py -dataset sst2 -nn distilbert -strategy greedy

# DIG (strategy: MaxCount)
python main.py -dataset sst2 -nn distilbert -strategy maxcount

Similarly, commands can be changed for other settings.

Please contact Soumya for any clarifications or suggestions.

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Related tags

Overview

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Dependencies

Evaluating DIG:

Owner

INK Lab @ USC

TF Image Segmentation: Image Segmentation framework

The source codes for TME-BNA: Temporal Motif-Preserving Network Embedding with Bicomponent Neighbor Aggregation.

Near-Duplicate Video Retrieval with Deep Metric Learning

SNE-RoadSeg in PyTorch, ECCV 2020

A state of the art of new lightweight YOLO model implemented by TensorFlow 2.

Mini-hmc-jax - A simple implementation of Hamiltonian Monte Carlo in JAX

reimpliment of DFANet: Deep Feature Aggregation for Real-Time Semantic Segmentation

A computer vision pipeline to identify the "icons" in Christian paintings

Gans-in-action - Companion repository to GANs in Action: Deep learning with Generative Adversarial Networks

Mememoji - A facial expression classification system that recognizes 6 basic emotions: happy, sad, surprise, fear, anger and neutral.

Leaderboard and Visualization for RLCard

This repository provides an unified frameworks to train and test the state-of-the-art few-shot font generation (FFG) models.

Code for Parameter Prediction for Unseen Deep Architectures (NeurIPS 2021)

Constructing interpretable quadratic accuracy predictors to serve as an objective function for an IQCQP problem that represents NAS under latency constraints and solve it with efficient algorithms.

Homepage of paper: Paint Transformer: Feed Forward Neural Painting with Stroke Prediction, ICCV 2021.

An official PyTorch Implementation of Boundary-aware Self-supervised Learning for Video Scene Segmentation (BaSSL)

Repository accompanying the "Sign Pose-based Transformer for Word-level Sign Language Recognition" paper

PyTorch Implementation of Temporal Output Discrepancy for Active Learning, ICCV 2021

This is an official source code for implementation on Extensive Deep Temporal Point Process

python debugger and anti-vm that checks if you're in a virtual machine or if someones trying to debug your file