Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Last update: Oct 14, 2022

Overview

About this repository

This repo contains an Pytorch implementation for the ACL 2017 paper Get To The Point: Summarization with Pointer-Generator Networks. The code framework is based on TextBox.

Environment

python >= 3.8.11
torch >= 1.6.0

Run install.sh to install other requirements.

Dataset

The processed dataset can be downloaded from Google Drive. Once finished, unzip the datafiles (train.src, train.tgt, ...) to ./data.

An overview of dataset: train: 287113 cases, dev: 13368 cases, test: 11490 cases

Paramters

# overall settings
data_path: 'data/'
checkpoint_dir: 'saved/'
generated_text_dir: 'generated/'
# dataset settings
max_vocab_size: 50000
src_len: 400
tgt_len: 100

# model settngs
decoding_strategy: 'beam_search'
beam_size: 4
is_attention: True
is_pgen: True
is_coverage: True
cov_loss_lambda: 1.0

Log file is located in ./log, more details can be found in yamls.

Note: Distributed Data Parallel (DDP) is not supported yet.

Train & Evaluation

From scratch run `fire.py`.

if __name__ == '__main__':
    config = Config(config_dict={'test_only': False,
                                 'load_experiment': None})
    train(config)

If you want to resume from a checkpoint, just set the 'load_experiment': './saved/$model_name$.pth'. Similarly, when 'test_only' is set to True, 'load_experiment' is required.

Results

The best model is trained on a TITAN Xp GPU (8GB usage).

Training loss

Ablation study

Model	Rouge-1	Rouge-2	Rouge-L
Seq2Seq	22.17	7.20	20.97
Seq2Seq+attn	29.35	12.58	27.38
Seq2Seq+attn+pgen	36.04	15.87	32.92
Seq2Seq+attn+pgen+coverage	39.52	17.85	36.40

Note: The architecture of the Seq2Seq model is based on lstm, I hope I can replace it with transformer in the future.

Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Related tags

Overview

About this repository

Environment

Dataset

Paramters

Train & Evaluation

From scratch run `fire.py`.

Results

Training loss

Ablation study

Owner

wxDai

Localized representation learning from Vision and Text (LoVT)

Deep learning toolbox based on PyTorch for hyperspectral data classification.

[ACM MM 2021] Yes, "Attention is All You Need", for Exemplar based Colorization

Notebooks for my "Deep Learning with TensorFlow 2 and Keras" course

Experiments for Operating Systems Lab (ETCS-352)

LIMEcraft: Handcrafted superpixel selectionand inspection for Visual eXplanations

A general-purpose programming language, focused on simplicity, safety and stability.

Implementation of the ivis algorithm as described in the paper Structure-preserving visualisation of high dimensional single-cell datasets.

MLP-Like Vision Permutator for Visual Recognition (PyTorch)

[ECCV 2020] Reimplementation of 3DDFAv2, including face mesh, head pose, landmarks, and more.

PyTorch implementation for Graph Contrastive Learning with Augmentations

Self-Supervised Image Denoising via Iterative Data Refinement

Repository of the paper Compressing Sensor Data for Remote Assistance of Autonomous Vehicles using Deep Generative Models at ML4AD @ NeurIPS 2021.

Segcache: a memory-efficient and scalable in-memory key-value cache for small objects

Fast, general, and tested differentiable structured prediction in PyTorch

RuDOLPH: One Hyper-Modal Transformer can be creative as DALL-E and smart as CLIP

Code for "R-GCN: The R Could Stand for Random"

Bayesian Optimization Library for Medical Image Segmentation.

Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]

Tools for computational pathology

Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Related tags

Overview

About this repository

Environment

Dataset

Paramters

Train & Evaluation

From scratch run fire.py.

Results

Training loss

Ablation study

Owner

wxDai

Localized representation learning from Vision and Text (LoVT)

Deep learning toolbox based on PyTorch for hyperspectral data classification.

[ACM MM 2021] Yes, "Attention is All You Need", for Exemplar based Colorization

Notebooks for my "Deep Learning with TensorFlow 2 and Keras" course

Experiments for Operating Systems Lab (ETCS-352)

LIMEcraft: Handcrafted superpixel selectionand inspection for Visual eXplanations

A general-purpose programming language, focused on simplicity, safety and stability.

Implementation of the ivis algorithm as described in the paper Structure-preserving visualisation of high dimensional single-cell datasets.

MLP-Like Vision Permutator for Visual Recognition (PyTorch)

[ECCV 2020] Reimplementation of 3DDFAv2, including face mesh, head pose, landmarks, and more.

PyTorch implementation for Graph Contrastive Learning with Augmentations

Self-Supervised Image Denoising via Iterative Data Refinement

Repository of the paper Compressing Sensor Data for Remote Assistance of Autonomous Vehicles using Deep Generative Models at ML4AD @ NeurIPS 2021.

Segcache: a memory-efficient and scalable in-memory key-value cache for small objects

Fast, general, and tested differentiable structured prediction in PyTorch

RuDOLPH: One Hyper-Modal Transformer can be creative as DALL-E and smart as CLIP

Code for "R-GCN: The R Could Stand for Random"

Bayesian Optimization Library for Medical Image Segmentation.

Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]

Tools for computational pathology

From scratch run `fire.py`.