The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models

Last update: Dec 14, 2022

Related tags

Overview

Graformer

The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models

Graformer (also named BridgeTransformer in the code) is a sequence-to-sequence model mainly for Neural Machine Translation. We improve the multilingual translation by taking advantage of pre-trained (masked) language models, including pre-trained encoder (BERT) and pre-trained decoder (GPT). The code is based on Fairseq.

Examples

You can start with run/run.sh, with some minor modification. The corresponding scripts represent:

train a pre-trained BERT:
    run_arnold_multilingual_masked_lm_6e6d.sh

train a pre-trained GPT:
    run_arnold_multilingual_lm_6e6d.sh

train a Graformer:
    run_arnold_multilingual_graft_transformer_12e12d_ted.sh

inference from Graformer:
    run_arnold_multilingual_graft_inference_ted.sh

Released Models

We release our pre-trained mBERT and mGPT, along with the trained Graformer model in here.

Tensorflow Version

We will provide the tensorflow version in Neurst, a popular toolkit for sequence processing.

Citation

Please cite as:

@inproceedings{sun2021mulilingual,
    title = "Multilingual Translation via Grafting Pre-trained Language Models",
    author = "Sun, Zewei and Wang, Mingxuan and Li, Lei",
    booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2021",
    year = "2021"
}

Contact

If you have any questions, please feel free to contact me: [email protected]

The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models

Related tags

Overview

Graformer

Examples

Released Models

Tensorflow Version

Citation

Contact

Owner

MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert, MILES uses the bert-base-multilingual-uncased model, as well as simple language-agnostic approaches to complex word identification (CWI) and candidate ranking.

A python wrapper around the ZPar parser for English.

File-based TF-IDF: Calculates keywords in a document, using a word corpus.

Code for paper "Role-oriented Network Embedding Based on Adversarial Learning between Higher-order and Local Features"

A Python module made to simplify the usage of Text To Speech and Speech Recognition.

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

TweebankNLP - Pre-trained Tweet NLP Pipeline (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Models + Tweebank-NER

Research code for "What to Pre-Train on? Efficient Intermediate Task Selection", EMNLP 2021

Modified GPT using average pooling to reduce the softmax attention memory constraints.

Spooky Skelly For Python

FedNLP: A Benchmarking Framework for Federated Learning in Natural Language Processing

SpikeX - SpaCy Pipes for Knowledge Extraction

justCTF [*] 2020 challenges sources

Implementation of TTS with combination of Tacotron2 and HiFi-GAN

A collection of models for image - text generation in ACM MM 2021.

Chinese Named Entity Recognization (BiLSTM with PyTorch)

Finds snippets in iambic pentameter in English-language text and tries to combine them to a rhyming sonnet.

State of the Art Natural Language Processing

Code for "Finetuning Pretrained Transformers into Variational Autoencoders"

An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.