Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

Last update: Dec 23, 2022

Related tags

Overview

Surface Form Competition

This is the official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right" We provide scripts for downloading/processing datasets and for reproducing our results on GPT-2 and GPT-3. We do not guarantee exact reproducibility, as library versions and GPUs may cause small differences, but these should be extremely minor.

Dependencies

We use python3 and pytorch 1.7.0, but we do not use cutting-edge features from either and expect to be largely forward and backward compatible. That is not a guarantee or promise.

You can use pip install -r requirements.txt to install the required libraries.

OpenAI Beta

To use GPT-3 you must use OpenAI Beta, which is limited access. You can apply for access here. Once you have access you will need to point the score.py to your API key with the --key argument or put your key in api.key which is the default path.

Downloading Datasets

DATA_README.md has thorough instructions for downloading and processing datasets. We provide automatic downloaders and processers for datasets where possible in data_downloaders/ but see DATA_README for full instructions.

Running Scorers

Once you have a dataset downloaded, running all the zero-shot scoring strategies at once is as simple as:

python score.py 
   
     --model

where is the abbreviation for a given dataset used for table rows in the paper. If there is any confusion, simply look in score.py to see how dataset selection works. is the name of either a GPT-2 or GPT-3 model e.g. xl, davinci, etc. To speed things up you can use a larger --batch if you have enough GPU memory.

Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

Related tags

Overview

Surface Form Competition

Dependencies

OpenAI Beta

Downloading Datasets

Running Scorers

Owner

Peter West

MaskTrackRCNN for video instance segmentation based on mmdetection

Pytorch implementation of “Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement”

This is a demo app to be used in the video streaming applications

YoloV5 implemented by TensorFlow2 , with support for training, evaluation and inference.

KSAI Lite is a deep learning inference framework of kingsoft, based on tensorflow lite

Software Platform for solving and manipulating multiparametric programs in Python

Final project for machine learning (CSC 590). Detection of hepatitis C and progression through blood samples.

Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).

[ICCV 2021] Group-aware Contrastive Regression for Action Quality Assessment

VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition

An exploration of log domain "alternative floating point" for hardware ML/AI accelerators.

Code repo for EMNLP21 paper "Zero-Shot Information Extraction as a Unified Text-to-Triple Translation"

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

Code to reproduce the results for Compositional Attention

Official repository with code and data accompanying the NAACL 2021 paper "Hurdles to Progress in Long-form Question Answering" (https://arxiv.org/abs/2103.06332).

Semi-supervised Representation Learning for Remote Sensing Image Classification Based on Generative Adversarial Networks

AlphaBot2 Pi Core software for interfacing with the various components.

Simulation of Self Driving Car

Generic image compressor for machine learning. Pytorch code for our paper "Lossy compression for lossless prediction".

Code for the head detector (HeadHunter) proposed in our CVPR 2021 paper Tracking Pedestrian Heads in Dense Crowd.