A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.

Last update: Dec 27, 2022

Related tags

Overview

P-tuning

A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.

How to use our code

We have released the code and datasets for LAMA and few-shot SuperGLUE (32-dev) experiments. Please check README.md and requirement.txt in the corresponding subdirectories for details.

The LAMA and FewGLUE_32dev datasets are available. The LAMA dataset should be placed in ./data directory, and the SuperGLUE dataset should be placed in the ./ (project root) directory.

Citation

If you find our work useful, please cite the following paper:

@article{liu2021gpt,
  title={GPT Understands, Too}, 
  author={Xiao Liu and Yanan Zheng and Zhengxiao Du and Ming Ding and Yujie Qian and Zhilin Yang and Jie Tang},
  year={2021},
  journal={arXiv preprint arXiv:2103.10385},
  url={https://arxiv.org/abs/2103.10385}
}

Owner

THUDM

Data Mining Research Group at Tsinghua University

GitHub Repository

It is an open dataset for object detection in remote sensing images.

RSOD-Dataset It is an open dataset for object detection in remote sensing images. The dataset includes aircraft, oiltank, playground and overpass. The

136 Dec 08, 2022

A booklet on machine learning systems design with exercises

Machine Learning Systems Design Read this booklet here. This booklet covers four main steps of designing a machine learning system: Project setup Data

7.6k Jan 08, 2023

Minimal implementation of Denoised Smoothing: A Provable Defense for Pretrained Classifiers in TensorFlow.

Denoised-Smoothing-TF Minimal implementation of Denoised Smoothing: A Provable Defense for Pretrained Classifiers in TensorFlow. Denoised Smoothing is

19 Dec 11, 2022

Find the Heart simple Python Game

This is a simple Python game for finding a heart emoji. There is a 3 x 3 matrix in which a heart emoji resides. The location of the heart is randomized and is not revealed. The player must guess the

1 Jan 24, 2022

Siamese TabNet

Raifhack-DS-2021 https://raifhack.ru/ - Команда Звёздочка Siamese TabNet Сиамская TabNet предсказывает стоимость объекта недвижимости с price_type=1,

15 Apr 16, 2022

Combine Tacotron2 and Hifi GAN to generate speech from text

EndToEndTextToSpeech Combine Tacotron2 and Hifi GAN to generate speech from text Download weights Hifi GAN - hifi_gan/checkpoint/ : pretrain 2.5M ste

1 Dec 18, 2021

Optimal space decomposition based-product quantization for approximate nearest neighbor search

Optimal space decomposition based-product quantization for approximate nearest neighbor search Abstract Product quantization(PQ) is an effective neare

1 Nov 19, 2021

Official implementation for the paper: "Multi-label Classification with Partial Annotations using Class-aware Selective Loss"

Multi-label Classification with Partial Annotations using Class-aware Selective Loss Paper | Pretrained models Official PyTorch Implementation Emanuel

99 Dec 27, 2022

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate. Website • Key Features • How To Use • Docs •

21.1k Jan 08, 2023

Repository for the paper "Exploring the Sensory Spaces of English Perceptual Verbs in Natural Language Data"

Sensory Spaces of English Perceptual Verbs This repository contains the code and collocational data described in the paper "Exploring the Sensory Spac

0 Sep 07, 2021

A flexible and extensible framework for gait recognition.

A flexible and extensible framework for gait recognition. You can focus on designing your own models and comparing with state-of-the-arts easily with the help of OpenGait.

335 Dec 22, 2022

Jupyter notebooks for using & learning Keras

deep-learning-with-keras-notebooks 這個github的repository主要是個人在學習Keras的一些記錄及練習。希望在學習過程中發現到一些好的資訊與範例也可以對想要學習使用 Keras來解決問題的同好，或是對深度學習有興趣的在學學生可以有一些方便理解與上手範例

2.1k Dec 27, 2022

OHLC Average Prediction of Apple Inc. Using LSTM Recurrent Neural Network

Stock Price Prediction of Apple Inc. Using Recurrent Neural Network OHLC Average Prediction of Apple Inc. Using LSTM Recurrent Neural Network Dataset:

410 Jan 05, 2023

Hierarchical Aggregation for 3D Instance Segmentation (ICCV 2021)

HAIS Hierarchical Aggregation for 3D Instance Segmentation (ICCV 2021) by Shaoyu Chen, Jiemin Fang, Qian Zhang, Wenyu Liu, Xinggang Wang*. (*) Corresp

145 Jan 05, 2023

source code of “Visual Saliency Transformer” (ICCV2021)

Visual Saliency Transformer (VST) source code for our ICCV 2021 paper “Visual Saliency Transformer” by Nian Liu, Ni Zhang, Kaiyuan Wan, Junwei Han, an

89 Dec 21, 2022

2021-MICCAI-Progressively Normalized Self-Attention Network for Video Polyp Segmentation

2021-MICCAI-Progressively Normalized Self-Attention Network for Video Polyp Segmentation Authors: Ge-Peng Ji*, Yu-Cheng Chou*, Deng-Ping Fan, Geng Che

85 Dec 30, 2022

XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale

XtremeDistilTransformers for Distilling Massive Multilingual Neural Networks ACL 2020 Microsoft Research [Paper] [Video] Releasing [XtremeDistilTransf

125 Jan 04, 2023

A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.

Related tags

Overview

P-tuning

How to use our code

Citation

Owner

THUDM

It is an open dataset for object detection in remote sensing images.

A booklet on machine learning systems design with exercises

Minimal implementation of Denoised Smoothing: A Provable Defense for Pretrained Classifiers in TensorFlow.

Find the Heart simple Python Game

Siamese TabNet

Combine Tacotron2 and Hifi GAN to generate speech from text

Optimal space decomposition based-product quantization for approximate nearest neighbor search

Official implementation for the paper: "Multi-label Classification with Partial Annotations using Class-aware Selective Loss"

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

Repository for the paper "Exploring the Sensory Spaces of English Perceptual Verbs in Natural Language Data"

A flexible and extensible framework for gait recognition.

Jupyter notebooks for using & learning Keras

OHLC Average Prediction of Apple Inc. Using LSTM Recurrent Neural Network

Hierarchical Aggregation for 3D Instance Segmentation (ICCV 2021)

source code of “Visual Saliency Transformer” (ICCV2021)

2021-MICCAI-Progressively Normalized Self-Attention Network for Video Polyp Segmentation

XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale

PyContinual (An Easy and Extendible Framework for Continual Learning)

Unofficial TensorFlow implementation of the Keyword Spotting Transformer model

Official repository for GCR rerank, a GCN-based reranking method for both image and video re-ID