Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Last update: Oct 27, 2022

Related tags

Overview

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Overview of paths used in DIG and IG. w is the word being attributed. The gray region is the neighborhood of w. Green line depicts the straight-line path from w to w' used by IG and the green squares are the corresponding interpolation points. Left: In DIG-Greedy, we first monotonize each word in the neighborhood (red arrow). Then the word closest to its corresponding monotonic point is selected as the anchor (blue line to w_5 since the red arrow of w_5 has the shortest magnitude). Right: In DIG-MaxCount we first count the number of monotonic dimensions for each word in the neighborhood (shown in [.] above). Then, the word with the highest number of monotonic dimensions is selected as the anchor word (blue line to w_4), followed by changing the non-monotonic dimensions of w_4 (red line to c). Repeating this step gives the zigzag blue path. Finally, the red stars are the interpolated points used by our method. Please refer to the paper for more details.

Dependencies

Dependencies can be installed using requirements.txt.

Evaluating DIG:

Install all the requirements from requirements.txt.
Execute ./setup.sh for setting up the folder hierarchy for experiments.

Commands for reproducing the reported results on DistilBERT fine-tuned on SST2:

# Generate the KNN graph
python knn.py -dataset sst2 -nn distilbert

# DIG (strategy: Greedy)
python main.py -dataset sst2 -nn distilbert -strategy greedy

# DIG (strategy: MaxCount)
python main.py -dataset sst2 -nn distilbert -strategy maxcount

Similarly, commands can be changed for other settings.

Please contact Soumya for any clarifications or suggestions.

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Related tags

Overview

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Dependencies

Evaluating DIG:

Owner

INK Lab @ USC

TensorFlow Tutorials with YouTube Videos

FreeSOLO for unsupervised instance segmentation, CVPR 2022

Data Consistency for Magnetic Resonance Imaging

This is the official implementation for the paper "(Almost) Free Incentivized Exploration from Decentralized Learning Agents" in NeurIPS 2021.

[CVPR'21] FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space

Official PyTorch implementation of Spatial Dependency Networks.

Implementation of paper: "Image Super-Resolution Using Dense Skip Connections" in PyTorch

People log into different sites every day to get information and browse through these sites one by one

A small tool to joint picture including gif

Pytorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

RAANet: Range-Aware Attention Network for LiDAR-based 3D Object Detection with Auxiliary Density Level Estimation

A pytorch reprelication of the model-based reinforcement learning algorithm MBPO

Code for Estimating Multi-cause Treatment Effects via Single-cause Perturbation (NeurIPS 2021)

Public Implementation of ChIRo from "Learning 3D Representations of Molecular Chirality with Invariance to Bond Rotations"

CityLearn Challenge Multi-Agent Reinforcement Learning for Intelligent Energy Management, 2020, PikaPika team

Official implementation of "A Unified Objective for Novel Class Discovery", ICCV2021 (Oral)

Lipschitz-constrained Unsupervised Skill Discovery

Build upon neural radiance fields to create a scene-specific implicit 3D semantic representation, Semantic-NeRF

Kaggle-titanic - A tutorial for Kaggle's Titanic: Machine Learning from Disaster competition. Demonstrates basic data munging, analysis, and visualization techniques. Shows examples of supervised machine learning techniques.

Tzer: TVM Implementation of "Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation (OOPSLA'22)“.