Source code for "Understanding Knowledge Integration in Language Models with Graph Convolutions"

Last update: Oct 18, 2022

Related tags

Deep Learning GCS_KI

Overview

Graph Convolution Simulator (GCS)

Source code for "Understanding Knowledge Integration in Language Models with Graph Convolutions"

Requirements:

PyTorch and DGL should be installed based on your system. For other libraries, you can install them using the following command:

$ pip install -r requirements.txt

Run Knowledge Integration Interpretation (KI) by GCS on example data:

$ bash run_example.sh

Interpretation results are saved in ./example/example_data/gcs.edgelist.

If the knowledge graph is small, users can visualize it by ./example/example_data/results.pdf. Here is the results for the example data:

Run Knowledge Intergration Interpretation by GCS for your own model

Step 1: Prepare the entity embedding of vanilla LM and knowledge-enhanced LM:

Store them as PyTorch tensor (.pt) format. Make sure they have the same number of rows, and the indexes of entities are the same. The default files are emb_roberta.pt and emb_kadapter.pt.

Step 2: Prepare the knowledge graph:

Three files are needed to load the knowledge graph:

a) qid2idx.json: The index dictionary. The key is entity Q-label, and value is the index of entity in entity embedding
b) qid2label.json : The label dictionary. The key is entity Q-label, and the value is the entity label text. Note that this dictionary is only for visualization, you can set it as {Q-label: Q-label} if you don't have the text.
c) kg.edgelist: The knowledge triple to construct knowledge graph. Each row is for one triple as: entity1_idx \t entity2_idx \t {}.

Step 3: Run GCS for KI interpretation:

After two preparation steps, you can run GCS by:

$ python src/example.py  --emb_vlm emb_roberta.pt  -emb_klm emb_kadapter.pt  --data_dir ./example_data  --lr 1e-3  --loss mi_loss

As for the hyperparameters, users may check them in ./example/src/example.py. Note that for large knowledge graphs, we recommend to use mutual information loss (mi_loss), and please do not visualize the results for large knowledge graphs.

Step 4: Analyze GCS interpretation results:

The interpretation results are saved in ./example/example_data/gcs.edgelist. Each row is for one triple as: entity1_idx \t entity2_idx \t {'a': xxxx}. Here, the value of 'a' is the attention coefficient value on the triple/entity (entity1, r, entity2). Users may use them to analyze the factual knowledge learned during knowledge integration.

Reproduce the results in the paper

Please enter ./all_exp folder for more details

Cite

If you use the code, please cite the paper:

@article{hou2022understanding,
  title={Understanding Knowledge Integration in Language Models with Graph Convolutions},
  author={Hou, Yifan and Fu, Guoji and Sachan, Mrinmaya},
  journal={arXiv preprint arXiv:2202.00964},
  year={2022}
}

Contact

Feel free to open an issue or send me ([email protected]) an email if you have any questions!

Source code for "Understanding Knowledge Integration in Language Models with Graph Convolutions"

Related tags

Overview

Graph Convolution Simulator (GCS)

Requirements:

Run Knowledge Integration Interpretation (KI) by GCS on example data:

Run Knowledge Intergration Interpretation by GCS for your own model

Step 1: Prepare the entity embedding of vanilla LM and knowledge-enhanced LM:

Step 2: Prepare the knowledge graph:

Step 3: Run GCS for KI interpretation:

Step 4: Analyze GCS interpretation results:

Reproduce the results in the paper

Cite

Contact

Owner

yifan

Face recognition with trained classifiers for detecting objects using OpenCV

Code for CVPR2021 "Visualizing Adapted Knowledge in Domain Transfer". Visualization for domain adaptation. #explainable-ai

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

Answering Open-Domain Questions of Varying Reasoning Steps from Text

Relaxed-machines - explorations in neuro-symbolic differentiable interpreters

Implementation of "A MLP-like Architecture for Dense Prediction"

An index of algorithms for learning causality with data

PEPit is a package enabling computer-assisted worst-case analyses of first-order optimization methods.

🤖 Project template for your next awesome AI project. 🦾

Official Code Release for "CLIP-Adapter: Better Vision-Language Models with Feature Adapters"

Unsupervised Learning of Multi-Frame Optical Flow with Occlusions

Code repo for EMNLP21 paper "Zero-Shot Information Extraction as a Unified Text-to-Triple Translation"

PyTorch Implementation of Realtime Multi-Person Pose Estimation project.

Based on the paper "Geometry-aware Instance-reweighted Adversarial Training" ICLR 2021 oral

Pytorch implementation for Semantic Segmentation/Scene Parsing on MIT ADE20K dataset

Official source code to CVPR'20 paper, "When2com: Multi-Agent Perception via Communication Graph Grouping"

This repository contains the entire code for our work "Two-Timescale End-to-End Learning for Channel Acquisition and Hybrid Precoding"

Real-Time Multi-Contact Model Predictive Control via ADMM

Neural network-based build time estimation for additive manufacturing

An AutoML Library made with Optuna and PyTorch Lightning