[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction

Last update: Jan 06, 2023

Overview

REval

Introduction
Overview
Requirements
Installation
Probing
Usage
Citation
License

🎓 Introduction

REval is a simple framework for probing sentence-level representations of Relation Extraction models.

✅ Requirements

REval is tested with:

Python 3.7

🚀 Installation

With pip

<TBD>

From source

git clone https://github.com/DFKI-NLP/REval
cd REval
pip install -r requirements.txt

🔬 Probing

Supported Datasets

SemEval 2010 Task 8 (CoreNLP annotated version) [LINK]
TACRED (obtained via LDC) [LINK]

Probing Tasks

Task	SemEval 2010	TACRED
ArgTypeHead	✔️	✔️
ArgTypeTail	✔️	✔️
Length	✔️	✔️
EntityDistance	✔️	✔️
ArgumentOrder		✔️
EntityExistsBetweenHeadTail	✔️	✔️
PosTagHeadLeft	✔️	✔️
PosTagHeadRight	✔️	✔️
PosTagTailLeft	✔️	✔️
PosTagTailRight	✔️	✔️
TreeDepth	✔️	✔️
SDPTreeDepth	✔️	✔️
ArgumentHeadGrammaticalRole	✔️	✔️
ArgumentTailGrammaticalRole	✔️	✔️

🔧 Usage

Step 1: create the probing task datasets from the original datasets.

SemEval 2010 Task 8

python reval.py generate-all-from-semeval \
    --train-file <SEMEVAL DIR>/train.json \
    --validation-file <SEMEVAL DIR>/dev.json \
    --test-file <SEMEVAL DIR>/test.json \
    --output-dir ./data/semeval/

TACRED

python reval.py generate-all-from-tacred \
    --train-file <TACRED DIR>/train.json \
    --validation-file <TACRED DIR>/dev.json \
    --test-file <TACRED DIR>/test.json \
    --output-dir ./data/tacred/

Step 2: Run the probing tasks on a model.

For example, download a Relation Extraction model trained with RelEx, e.g., the CNN trained on SemEval.

mkdir -p models/cnn_semeval
wget --content-disposition https://cloud.dfki.de/owncloud/index.php/s/F3gf9xkeb2foTFe/download -P models/cnn_semeval

python probing_task_evaluation.py \
    --model-dir ./models/cnn_semeval/ \
    --data-dir ./data/semeval/ \
    --dataset semeval2010 \
    --cuda-device 0 \
    --batch-size 64 \
    --cache-representations

After the run is completed, the results are stored to probing_task_results.json in the model-dir.

{
    "ArgTypeHead": {
        "acc": 75.82,
        "devacc": 78.96,
        "ndev": 670,
        "ntest": 2283
    },
    "ArgTypeTail": {
        "acc": 75.4,
        "devacc": 78.79,
        "ndev": 627,
        "ntest": 2130
    },
    [...]
}

📚 Citation

If you use REval, please consider citing the following paper:

@inproceedings{alt-etal-2020-probing,
    title={Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction},
    author={Christoph Alt and Aleksandra Gabryszak and Leonhard Hennig},
    year={2020},
    booktitle={Proceedings of ACL},
    url={https://arxiv.org/abs/2004.08134}
}

📘 License

REval is released under the terms of the MIT License.

[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction

Related tags

Overview

REval

Table of Contents

🎓 Introduction

✅ Requirements

🚀 Installation

With pip

From source

🔬 Probing

Supported Datasets

Probing Tasks

🔧 Usage

Step 1: create the probing task datasets from the original datasets.

SemEval 2010 Task 8

TACRED

Step 2: Run the probing tasks on a model.

📚 Citation

📘 License

Owner

Unofficial PyTorch Implementation of "Augmenting Convolutional networks with attention-based aggregation"

This is the code for "HyperNeRF: A Higher-Dimensional Representation for Topologically Varying Neural Radiance Fields".

Spatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)

OpenLT: An open-source project for long-tail classification

This thesis is mainly concerned with state-space methods for a class of deep Gaussian process (DGP) regression problems

A deep neural networks for images using CNN algorithm.

Real-Time Semantic Segmentation in Mobile device

This solves the autonomous driving issue which is supported by deep learning technology. Given a video, it splits into images and predicts the angle of turning for each frame.

An Unbiased Learning To Rank Algorithms (ULTRA) toolbox

Code for "3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop"

Omnidirectional camera calibration in python

Portfolio analytics for quants, written in Python

darija <-> english dictionary

GluonMM is a library of transformer models for computer vision and multi-modality research

Generative Handwriting using LSTM Mixture Density Network with TensorFlow

A transformer-based method for Healthcare Image Captioning in Vietnamese

The Submission for SIMMC 2.0 Challenge 2021

GraPE is a Rust/Python library for high-performance Graph Processing and Embedding.

Molecular AutoEncoder in PyTorch