This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”

Last update: Dec 11, 2022

Related tags

Deep Learning prompt_semantics

Overview

This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”

Usage

To replicate our results in Section 4, run:

python3 prompt_tune.py \
    --save-dir ../runs/prompt_tuned_sec4/ \
    --prompt-path ../data/binary_NLI_prompts.csv \
    --experiment-name sec4 \
    --few-shots 3,5,10,20,30,50,100,250 \
    --production \
    --seeds 1

Add --fully-train if you want to train on the entire training set in addition to few-shot settings.

To replicate Section 5, run:

python3 prompt_tune.py \
    --save-dir ../runs/prompt_tuned_sec5/ \
    --prompt-path ../data/binary_NLI_prompts_permuted.csv \
    --experiment-name sec5 \
    --few-shots 3,5,10,20,30,50,100,250 \
    --production \
    --seeds 1

To get a fine-tuning baseline (Figure 1):

python3 fine_tune.py \
    --save-dir ../runs/fine_tune/ \
    --epochs 5 \
    --few-shots 3,5,10,20,30,50,100,250 \
    --fully-train \
    --production \
    --seeds 1

To replicate our exact results, use --seeds 1,2,3,4,5,6,7,8, which yields starting_example_index of 550,231,974,966,1046,2350,1326,928 respectively. This is important for ensuring that all models trained under the same seed always see exactly the same training examples. See paper Section 3 for more details.

If these seeds do not generate the same starting_example_index for you (which you can check in the output CSV files), you will have to manually specify the few-shot subset of training examples. I plan to add an argparse argument for this to make it easy.

All other hyperparameters are the same as the argparse default.

Miscellaneous Notes

You might notice that the code and output files are set up to produce a fine-grained analysis of HANS (McCoy et al., 2019). We actually run all of our main experiments on HANS as well and got similar results, which we plan to write up in a future version of our paper. Meanwhile, if you’re curious, feel free to add --do-diagnosis which will report the results on HANS.

Requirements

Python 3.9.

3.7 should mostly work too. You’d have to just replace the new built-in type hints and dictionary union operators with their older equivalents.

Activate your preferred virtual envrionment and then run pip install -r requirements.txt. If you want to replicate our exact results, use

torch==1.9.0+cu111
transformers==4.9.2
datasets==1.11.0

This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”

Related tags

Overview

Usage

Miscellaneous Notes

Requirements

Owner

Albert Webson

Libraries, tools and tasks created and used at DeepMind Robotics.

Official implementation of Monocular Quasi-Dense 3D Object Tracking

modelvshuman is a Python library to benchmark the gap between human and machine vision

【steal piano】GitHub偷情分析工具！

piSTAR Lab is a modular platform built to make AI experimentation accessible and fun. (pistar.ai)

The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"

MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.

2D Time independent Schrodinger equation solver for arbitrary shape of well

Codes for "CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation"

Eye-Blink-Counter - Python based Computer Vision project which counts how many time a person blinks

Benchmark for the generalization of 3D machine learning models across different remeshing/samplings of a surface.

PyTorch implementation of Super SloMo by Jiang et al.

A symbolic-model-guided fuzzer for TLS

A Java implementation of the experiments for the paper "k-Center Clustering with Outliers in Sliding Windows"

Fast Learning of MNL Model From General Partial Rankings with Application to Network Formation Modeling

Tensorflow Implementation of Pixel Transposed Convolutional Networks (PixelTCN and PixelTCL)

For IBM Quantum Challenge 2021 (May 20 - 26)

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Official implementation of the paper 'Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution' in CVPR 2022

Unsupervised Image Generation with Infinite Generative Adversarial Networks