JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Last update: Oct 26, 2022

Related tags

Deep Learning JASS

Overview

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

This the repository for this paper.

Find extensions of this work and new pre-trained models here: code, paper

Requirements

Install OpenNMT-py (1.0) and subword-nmt.

pip install OpenNMT-py
pip install subword-nmt

Pre-trained JASS models

We release JASS models on 2 language pairs: ja+en, ja+ru. For Japanese seq2seq pretraining, we use our proposed JASS methods while MASS is utilized for English and Russian.

Model	Vocabulary	BPE codes
JASS-jaen	ja-en	ja-en.bpe.codes
JASS-jaru	ja-ru	ja-ru.bpe.codes

Usage

Run the bpe precrocessing for the dataset to be finetuned. After setting up the downloaded vocabulary for src and tgt sentences during the preprocessing phase by preprocess.py of OpenNMT, use train_from argument of train.py in OpenNMT to implement the finetuning for the pretrained model.

Others

We will update the current Japanese--English pre-trained model and release pretrained models on Japanese--Chinese and Japanese--Korean. We released new models here: code

Reference

[1] Zhuoyuan Mao, Fabien Cromieres, Raj Dabre, Haiyue Song, Sadao Kurohashi, JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

@inproceedings{mao-etal-2020-jass,
    title = "{JASS}: {J}apanese-specific Sequence to Sequence Pre-training for Neural Machine Translation",
    author = "Mao, Zhuoyuan  and
      Cromieres, Fabien  and
      Dabre, Raj  and
      Song, Haiyue  and
      Kurohashi, Sadao",
    booktitle = "Proceedings of The 12th Language Resources and Evaluation Conference",
    month = may,
    year = "2020",
    address = "Marseille, France",
    publisher = "European Language Resources Association",
    url = "https://www.aclweb.org/anthology/2020.lrec-1.454",
    pages = "3683--3691",
    language = "English",
    ISBN = "979-10-95546-34-4",
}

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Related tags

Overview

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Requirements

Pre-trained JASS models

Usage

Others

Reference

Owner

Zhuoyuan Mao

This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 2020

A lossless neural compression framework built on top of JAX.

Code for the paper "Relation of the Relations: A New Formalization of the Relation Extraction Problem"

A code implementation of AC-GC: Activation Compression with Guaranteed Convergence, in NeurIPS 2021.

Lightweight tool to perform MITM attack on local network

Implementation of the paper All Labels Are Not Created Equal: Enhancing Semi-supervision via Label Grouping and Co-training

Image augmentation library in Python for machine learning.

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

ScaleNet: A Shallow Architecture for Scale Estimation

FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

Structure Information is the Key: Self-Attention RoI Feature Extractor in 3D Object Detection

Scalable Optical Flow-based Image Montaging and Alignment

Synthetic Scene Text from 3D Engines

Implementation of popular SOTA self-supervised learning algorithms as Fastai Callbacks.

PyTorch implementation of "Dataset Knowledge Transfer for Class-Incremental Learning Without Memory" (WACV2022)

This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

Automated Hyperparameter Optimization Competition

A port of muP to JAX/Haiku

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis