Extracting knowledge graphs from language models as a diagnostic benchmark of model performance.

Last update: Oct 25, 2022

Overview

Interpreting Language Models Through Knowledge Graph Extraction

Idea: How do we interpret what a language model learns at various stages of training? Language models have been recently described as open knowledge bases. We can generate knowledge graphs by extracting relation triples from masked language models at sequential epochs or architecture variants to examine the knowledge acquisition process.

Dataset: Squad, Google-RE (3 flavors)

Models: BERT, RoBeRTa, DistilBert, training RoBERTa from scratch

Authors: Vinitra Swamy, Angelika Romanou, Martin Jaggi

This repository is the official implementation of the NeurIPS 2021 XAI4Debugging paper titled "Interpreting Language Models Through Knowledge Graph Extraction". Found this work useful? Please cite our paper.

Quick Start Guide

Pretrained Model (BERT, DistilBERT, RoBERTa) -> Knowlege Graph

Install requirements and clone repository

git clone https://github.com/epfml/interpret-lm-knowledge.git
pip install git+https://github.com/huggingface/transformers   
pip install textacy
cd interpret-lm-knowledge/scripts

Generate knowledge graphs and dataframes python run_knowledge_graph_experiments.py <dataset> <model> <use_spacy>
e.g. squad Bert spacy
e.g. re-place-birth Roberta

options:

dataset=squad - "squad", "re-place-birth", "re-date-birth", "re-place-death"  
model=Roberta - "Bert", "Roberta", "DistilBert"  
extractor=spacy - "spacy", "textacy", "custom"

See run_lm_experiments notebook for examples.

Train LM model from scratch -> Knowledge Graph

Install requirements and clone repository

!pip install git+https://github.com/huggingface/transformers
!pip list | grep -E 'transformers|tokenizers'
!pip install textacy

Run wikipedia_train_from_scratch_lm.ipynb.
As included in the last cell of the notebook, you can run the KG generation experiments by:

from run_training_kg_experiments import *
run_experiments(tokenizer, model, unmasker, "Roberta3e")

Citations

@inproceedings{swamy2021interpreting,
 author = {Swamy, Vinitra and Romanou, Angelika and Jaggi, Martin},
 booktitle = {Advances in Neural Information Processing Systems, Workshop on eXplainable AI Approaches for Debugging and Diagnosis},
 title = {Interpreting Language Models Through Knowledge Graph Extraction},
 volume = {35},
 year = {2021}
}

Extracting knowledge graphs from language models as a diagnostic benchmark of model performance.

Related tags

Overview

Interpreting Language Models Through Knowledge Graph Extraction

Quick Start Guide

Pretrained Model (BERT, DistilBERT, RoBERTa) -> Knowlege Graph

Train LM model from scratch -> Knowledge Graph

Citations

Owner

EPFL Machine Learning and Optimization Laboratory

Let's Git - Versionsverwaltung & Open Source Hausaufgabe

Investigating Attention Mechanism in 3D Point Cloud Object Detection (arXiv 2021)

Model serving at scale

Mscp jamf - Build compliance in jamf

Code for "Learning From Multiple Experts: Self-paced Knowledge Distillation for Long-tailed Classification", ECCV 2020 Spotlight

Shuffle Attention for MobileNetV3

GND-Nets (Graph Neural Diffusion Networks) in TensorFlow.

[SIGGRAPH Asia 2021] DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning.

Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"

SAT: 2D Semantics Assisted Training for 3D Visual Grounding, ICCV 2021 (Oral)

An Inverse Kinematics library aiming performance and modularity

Active learning for Mask R-CNN in Detectron2

Pytorch implementation of NeurIPS 2021 paper: Geometry Processing with Neural Fields.

Pytorch Implementation of "Contrastive Representation Learning for Exemplar-Guided Paraphrase Generation"

App for identification of various objects. Based on YOLO v4 tiny architecture

Deal or No Deal? End-to-End Learning for Negotiation Dialogues

Feup-csr - Repository holding my group's submission to the CSR project competition

[CVPR 2022] Official PyTorch Implementation for "Reference-based Video Super-Resolution Using Multi-Camera Video Triplets"

Code for PhySG: Inverse Rendering with Spherical Gaussians for Physics-based Relighting and Material Editing

Oriented Object Detection: Oriented RepPoints + Swin Transformer/ReResNet