Code for our paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization", ACL 2021

Last update: Dec 12, 2022

Related tags

Deep Learning SimCLS

Overview

SimCLS

Code for our paper: "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization", ACL 2021

1. How to Install

Requirements

python3
conda create --name env --file spec-file.txt
pip3 install -r requirements.txt

Description of Codes

main.py -> training and evaluation procedure
model.py -> models
data_utils.py -> dataloader
utils.py -> utility functions
preprocess.py -> data preprocessing

Workspace

Following directories should be created for our experiments.

./cache -> storing model checkpoints
./result -> storing evaluation results

2. Preprocessing

We use the following datasets for our experiments.

CNN/DailyMail -> https://github.com/abisee/cnn-dailymail
XSum -> https://github.com/EdinburghNLP/XSum

For data preprocessing, please run

python preprocess.py --src_dir [path of the raw data] --tgt_dir [output path] --split [train/val/test] --cand_num [number of candidate summaries]

src_dir should contain the following files (using test split as an example):

test.source
test.source.tokenized
test.target
test.target.tokenized
test.out
test.out.tokenized

Each line of these files should contain a sample. In particular, you should put the candidate summaries for one data sample at neighboring lines in test.out and test.out.tokenized.

The preprocessing precedure will store the processed data as seperate json files in tgt_dir.

We have provided an example file in ./example.

3. How to Run

Hyper-parameter Setting

You may specify the hyper-parameters in main.py.

Train

python main.py --cuda --gpuid [list of gpuid] -l

Fine-tune

python main.py --cuda --gpuid [list of gpuid] -l --model_pt [model path]

Evaluate

python main.py --cuda --gpuid [single gpu] -e --model_pt [model path]

4. Results

CNNDM

	ROUGE-1	ROUGE-2	ROUGE-L
BART	44.39	21.21	41.28
Ours	46.67	22.15	43.54

XSum

	ROUGE-1	ROUGE-2	ROUGE-L
Pegasus	47.10	24.53	39.23
Ours	47.61	24.57	39.44

Our model outputs on these datasets can be found in ./output.

Code for our paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization", ACL 2021

Related tags

Overview

SimCLS

1. How to Install

Requirements

Description of Codes

Workspace

2. Preprocessing

3. How to Run

Hyper-parameter Setting

Train

Fine-tune

Evaluate

4. Results

CNNDM

XSum

Owner

Yixin Liu

CompilerGym is a library of easy to use and performant reinforcement learning environments for compiler tasks

Official code for UnICORNN (ICML 2021)

Pre-Training Graph Neural Networks for Cold-Start Users and Items Representation.

[ICCV 2021 (oral)] Planar Surface Reconstruction from Sparse Views

You are AllSet: A Multiset Function Framework for Hypergraph Neural Networks.

Regulatory Instruments for Fair Personalized Pricing.

An image processing project uses Viola-jones technique to detect faces and then use SIFT algorithm for recognition.

The repository for freeCodeCamp's YouTube course, Algorithmic Trading in Python

Video lie detector using xgboost - A video lie detector using OpenFace and xgboost

This repository contains the code for EMNLP-2021 paper "Word-Level Coreference Resolution"

Code, final versions, and information on the Sparkfun Graphical Datasheets

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

VLG-Net: Video-Language Graph Matching Networks for Video Grounding

git《Beta R-CNN: Looking into Pedestrian Detection from Another Perspective》(NeurIPS 2020) GitHub:[fig3]

Cross-view Transformers for real-time Map-view Semantic Segmentation (CVPR 2022 Oral)

GE2340 project source code without credentials.

Adversarial Graph Representation Adaptation for Cross-Domain Facial Expression Recognition (AGRA, ACM 2020, Oral)

Ready-to-use code and tutorial notebooks to boost your way into few-shot image classification.

DCSL - Generalizable Crowd Counting via Diverse Context Style Learning

Simulation of Self Driving Car