Source code for Transformer-based Multi-task Learning for Disaster Tweet Categorisation (UCD's participation in TREC-IS 2020A, 2020B and 2021A).

Last update: Oct 19, 2021

Overview

Source code for "UCD participation in TREC-IS 2020A, 2020B and 2021A".

*** update at: 2021/05/25

This repo so far relates to the following work:

Transformer-based Multi-task Learning for Disaster Tweet Categorisation, (WiP paper, ISCRAM 2021)
Multi-task transfer learning for finding actionable information from crisis-related messages on social media, (paper, TREC 2020)

Setup

git clone https://github.com/wangcongcong123/crisis-mtl.git
pip install -r requirements.txt

Dataset preparation

Download the splits prepared for the system from here that contains three subdirectories for 2020a, 2020b and 2021a respectively.
Unzip the file to data/.

Training and submitting

# for 2020a
python run.py --dataset_name 2020a --model_name bert-base-uncased

# or for 2020b
python run.py --edition 2020b --model_name bert-base-uncased
python run.py --edition 2020b --model_name google/electra-base-discriminator
python run.py --edition 2020b --model_name microsoft/deberta-base
python run.py --edition 2020b --model_name distilbert-base-uncased
python submit_ensemble.py --edition 2020b


# or for 2021a
python run.py --edition 2021a --model_name bert-base-uncased
python run.py --edition 2021a --model_name google/electra-base-discriminator
python run.py --edition 2021a --model_name microsoft/deberta-base
python run.py --edition 2021a --model_name distilbert-base-uncased
python submit_ensemble.py --edition 2021a

To see our results compared to other participating runs in 2020a and 2020b, check the appendix of this overview paper. To know the details of our approach, check this ISCRAM-2021 paper on 2020a and this TREC-2020 paper on 2020b. The evaluation for 2021a is still in process so the results will be added as soon as they come out.

Citation

If you use the code in your research, please consider citing the following papers:

@article{wang2021,
author = {Wang, Congcong and Nulty, Paul and Lillis, David},
journal = {Proceedings of the International ISCRAM Conference},
keywords = {18th International Conference on Information Systems for Crisis Response and Management (ISCRAM 2021)},
number = {May},
title = {{Transformer-based Multi-task Learning for Disaster Tweet Categorisation}},
volume = {2021-May},
year = {2021}
}

@inproceedings{congcong2020multi,
 address = {Gaithersburg, MD},
 title = {Multi-task transfer learning for finding actionable information from crisis-related messages on social media},
 booktitle = {Proceedings of the Twenty-Nineth {{Text REtrieval Conference}} ({{TREC}} 2020)},
 author = {Wang, Congcong and Lillis, David},
 year = {2020},
}

Queries

Let me know if any questions via [email protected] or through creating an issue.

Source code for Transformer-based Multi-task Learning for Disaster Tweet Categorisation (UCD's participation in TREC-IS 2020A, 2020B and 2021A).

Related tags

Overview

Source code for "UCD participation in TREC-IS 2020A, 2020B and 2021A".

Setup

Dataset preparation

Training and submitting

Citation

Queries

Owner

Congcong Wang

A method to perform unsupervised cross-region adaptation of crop classifiers trained with satellite image time series.

[CVPR'21] Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild

FastCover: A Self-Supervised Learning Framework for Multi-Hop Influence Maximization in Social Networks by Anonymous.

FAMIE is a comprehensive and efficient active learning (AL) toolkit for multilingual information extraction (IE)

Dilated RNNs in pytorch

Improving Transferability of Representations via Augmentation-Aware Self-Supervision

Training a deep learning model on the noisy CIFAR dataset

Few-Shot Graph Learning for Molecular Property Prediction

CPF: Learning a Contact Potential Field to Model the Hand-object Interaction

ByteTrack with ReID module following the paradigm of FairMOT, tracking strategy is borrowed from FairMOT/JDE.

Companion code for the paper "An Infinite-Feature Extension for Bayesian ReLU Nets That Fixes Their Asymptotic Overconfidence" (NeurIPS 2021)

Example Of Fine-Tuning BERT For Named-Entity Recognition Task And Preparing For Cloud Deployment Using Flask, React, And Docker

All supplementary material used by me while TA-ing CS3244: Machine Learning

Evaluation toolkit of the informative tracking benchmark comprising 9 scenarios, 180 diverse videos, and new challenges.

Lorien: A Unified Infrastructure for Efficient Deep Learning Workloads Delivery

This is the implementation of the paper LiST: Lite Self-training Makes Efficient Few-shot Learners.

Unofficial Tensorflow 2 implementation of the paper Implicit Neural Representations with Periodic Activation Functions

LEDNet: A Lightweight Encoder-Decoder Network for Real-time Semantic Segmentation

LibFewShot: A Comprehensive Library for Few-shot Learning.

Nest - A flexible tool for building and sharing deep learning modules