HIVE: Evaluating the Human Interpretability of Visual Explanations

Last update: Dec 13, 2022

Related tags

Overview

HIVE: Evaluating the Human Interpretability of Visual Explanations

Project Page | Paper

This repo provides the code for HIVE, a human evaluation framework for interpretability methods in computer vision.

@article{kim2021hive,
  author = {Sunnie S. Y. Kim and Nicole Meister and Vikram V. Ramaswamy and Ruth Fong and Olga Russakovsky},
  title = {{HIVE}: Evaluating the Human Interpretability of Visual Explanations},
  journal = {CoRR},
  volume = {abs/2112.03184},
  year = {2021}
}

Our study UIs

Distinction task

combined_gradcam_nolabels.html
combined_bagnet_nolabels.html
combined_protopnet_distinction.html
combined_prototree_distinction.html

Agreement task

combined_protopnet_agreement.html
combined_prototree_agreement.html

Additional studies

combined_gradcam_labels.html
combined_bagnet_labels.html
combined_prototree_agreement_tree.html

Running human studies

We ran our studies through Human Intelligence Tasks (HITs) deployed on Amazon Mechanical Turk (AMT). We use simple-amt, a microframework for working with AMT. Here we describe which files correspond to which study UIs and provide brief instructions for running studies.

Brief instructions on how to run user studies on AMT

Please check out the original simple-amt repository for more information on how to run a HIT on AMT.

Launch HITs on AMT

python launch_hits.py \
--html_template=hit_templates/combined_prototree_distinction.html \
--hit_properties_file=hit_properties/properties.json \
--input_json_file=examples/input_prototree_distinction.txt \
--hit_ids_file=examples/hit_ids_prototree_distinction.txt --prod

Check HIT progress

python show_hit_progress.py \
--hit_ids_file=examples/hit_ids_prototree_distinction.txt --prod

Get results

python get_results.py \
  --hit_ids_file=examples/hit_ids_prototree_distinction.txt \
  --output_file=examples/results_prototree_distinction.txt \
  > examples/results_prototree_distinction.txt --prod

Approve work

python approve_hits.py \
--hit_ids_file=examples/hit_ids_prototree_distinction.txt --prod

HIVE: Evaluating the Human Interpretability of Visual Explanations

Related tags

Overview

HIVE: Evaluating the Human Interpretability of Visual Explanations

Project Page | Paper

Our study UIs

Distinction task

Agreement task

Additional studies

Running human studies

Brief instructions on how to run user studies on AMT

Launch HITs on AMT

Check HIT progress

Get results

Approve work

Owner

Princeton Visual AI Lab

Implementation of PersonaGPT Dialog Model

Multiple types of NN model optimization environments. It is possible to directly access the host PC GUI and the camera to verify the operation. Intel iHD GPU (iGPU) support. NVIDIA GPU (dGPU) support.

Automatic tool focused on deriving metallicities of open clusters

PiRank: Learning to Rank via Differentiable Sorting

MetaDrive: Composing Diverse Scenarios for Generalizable Reinforcement Learning

Official Implementation of Swapping Autoencoder for Deep Image Manipulation (NeurIPS 2020)

CVPR2021: Temporal Context Aggregation Network for Temporal Action Proposal Refinement

The dataset and source code for our paper: "Did You Ask a Good Question? A Cross-Domain Question IntentionClassification Benchmark for Text-to-SQL"

JumpDiff: Non-parametric estimator for Jump-diffusion processes for Python

Guided Internet-delivered Cognitive Behavioral Therapy Adherence Forecasting

Neural Geometric Level of Detail: Real-time Rendering with Implicit 3D Shapes (CVPR 2021 Oral)

Reviving Iterative Training with Mask Guidance for Interactive Segmentation

An Abstract Cyber Security Simulation and Markov Game for OpenAI Gym

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

Convert Pytorch model to onnx or tflite, and the converted model can be visualized by Netron

Weakly Supervised Segmentation by Tensorflow.

DeepI2I: Enabling Deep Hierarchical Image-to-Image Translation by Transferring from GANs

Simple data balancing baselines for worst-group-accuracy benchmarks.

Google Landmark Recogntion and Retrieval 2021 Solutions

AdelaiDepth is an open source toolbox for monocular depth prediction.