Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources (NAACL-2021).

Last update: Sep 09, 2022

Overview

Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources

Description

This is the repository for the paper Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources, to be presented at NAACL 2021 by Simone Conia, Andrea Bacciu and Roberto Navigli.

Abstract

While cross-lingual techniques are finding increasing success in a wide range of Natural Language Processing tasks, their application to Semantic Role Labeling (SRL) has been strongly limited by the fact that each language adopts its own linguistic formalism, from PropBank for English to AnCora for Spanish and PDT-Vallex for Czech, inter alia. In this work, we address this issue and present a unified model to perform cross-lingual SRL over heterogeneous linguistic resources. Our model implicitly learns a high-quality mapping for different formalisms across diverse languages without resorting to word alignment and/or translation techniques. We find that, not only is our cross-lingual system competitive with the current state of the art but that it is also robust to low-data scenarios. Most interestingly, our unified model is able to annotate a sentence in a single forward pass with all the inventories it was trained with, providing a tool for the analysis and comparison of linguistic theories across different languages.

Download

You can download a copy of all the files in this repository by cloning the git repository:

git clone https://github.com/SapienzaNLP/unify-srl.git

or download a zip archive.

Model Checkpoint

Link to Drive

To install

To install you can use the environment.yml.
To use the model with NVIDIA CUDA remember to install the torch-scatter package made for CUDA as described in the documentation.

Cite this work

@inproceedings{conia-etal-2021-unify-srl,
    title = "Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources",
    author = "Conia, Simone  and
      Bacciu, Andrea  and
      Navigli, Roberto",
    booktitle = "Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies",
    month = jun,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2021.naacl-main.31",
    pages = "338--351",
}

Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources (NAACL-2021).

Related tags

Overview

Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources

Description

Abstract

Download

Model Checkpoint

To install

Cite this work

Owner

Sapienza NLP group

Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction

BERT score for text generation

Calibre recipe to convert latest issue of Analyse & Kritik into an ebook

Unet-TTS: Improving Unseen Speaker and Style Transfer in One-shot Voice Cloning

My implementation of Safaricom Machine Learning Codility test. The code has bugs, logical I guess I made errors and any correction will be appreciated.

The proliferation of disinformation across social media has led the application of deep learning techniques to detect fake news.

spaCy-wrap: For Wrapping fine-tuned transformers in spaCy pipelines

PyJPBoatRace: Python-based Japanese boatrace tools 🚤

🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)

HuggingTweets - Train a model to generate tweets

ConvBERT-Prod

jiant is an NLP toolkit

NLP-based analysis of poor Chinese movie reviews on Douban

This is an incredibly powerful calculator that is capable of many useful day-to-day functions.

vits chinese, tts chinese, tts mandarin

PG-19 Language Modelling Benchmark

LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language

Neural network sequence labeling model

Neural text generators like the GPT models promise a general-purpose means of manipulating texts.

Yet Another Compiler Visualizer