An example project using OpenPrompt under pytorch-lightning for prompt-based SST2 sentiment analysis model

Last update: Oct 21, 2022

Related tags

Overview

pl_prompt_sst

An example project using OpenPrompt under the framework of pytorch-lightning for a training prompt-based text classification model on SST2 sentiment analysis dataset. Leveraging the pytorch-lightning features like logging, gradient accumulation and early stopping, etc. Can be used as a template for further development.

Run

Install requirement

pip install -r requirements.txt

Setup the prompt to use in sst2/prompt_config.json

{
    "template_text": "{\"placeholder\": \"text_a\"} In summary, the film was {\"mask\"}.",
    "label_words": [["bad"], ["good"]]
}

Adjust the arguments in run.sh or the code below for your need, and run it.

CUDA_VISIBLE_DEVICES=0 python -u main.py --input_dir ./sst2 \
                                         --prompt_config_dir ./sst2/prompt_config.json \
                                         --model_class bert \
                                         --model_name_or_path prajjwal1/bert-tiny \
                                         --lr 2e-4
                                         --bs 32 \
                                         --max_seq_length 64 \
                                         --patience 4 \
                                         --accumulation 2 \
                                         --seed 666

In my preliminary experiment with the settings above, the model achieve 0.822 F1 compared to 0.820 without prompt.

Note

Can only be executed after this fix on state_dict()

An example project using OpenPrompt under pytorch-lightning for prompt-based SST2 sentiment analysis model

Related tags

Overview

pl_prompt_sst

Run

Note

Owner

Zhiling Zhang

Machine learning models from Singapore's NLP research community

PyTranslator é simultaneamente um editor e tradutor de texto com diversos recursos e interface feito com coração e 100% em Python

BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model

Download videos from YouTube/Twitch/Twitter right in the Windows Explorer, without installing any shady shareware apps

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Deploying a Text Summarization NLP use case on Docker Container Utilizing Nvidia GPU

MASS: Masked Sequence to Sequence Pre-training for Language Generation

An implementation of WaveNet with fast generation

Binary LSTM model for text classification

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

FireFlyer Record file format, writer and reader for DL training samples.

Deeply Supervised, Layer-wise Prediction-aware (DSLP) Transformer for Non-autoregressive Neural Machine Translation

spaCy-wrap: For Wrapping fine-tuned transformers in spaCy pipelines

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.

ThinkTwice: A Two-Stage Method for Long-Text Machine Reading Comprehension

This code is the implementation of Text Emotion Recognition (TER) with linguistic features

The source code of HeCo

Curso práctico: NLP de cero a cien 🤗

This is the offline-training-pipeline for our project.

A collection of Classical Chinese natural language processing models, including Classical Chinese related models and resources on the Internet.