A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.

Last update: Dec 27, 2022

Related tags

Overview

P-tuning

A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.

How to use our code

We have released the code and datasets for LAMA and few-shot SuperGLUE (32-dev) experiments. Please check README.md and requirement.txt in the corresponding subdirectories for details.

The LAMA and FewGLUE_32dev datasets are available. The LAMA dataset should be placed in ./data directory, and the SuperGLUE dataset should be placed in the ./ (project root) directory.

Citation

If you find our work useful, please cite the following paper:

@article{liu2021gpt,
  title={GPT Understands, Too}, 
  author={Xiao Liu and Yanan Zheng and Zhengxiao Du and Ming Ding and Yujie Qian and Zhilin Yang and Jie Tang},
  year={2021},
  journal={arXiv preprint arXiv:2103.10385},
  url={https://arxiv.org/abs/2103.10385}
}

Owner

THUDM

Data Mining Research Group at Tsinghua University

GitHub Repository

GEP (GDB Enhanced Prompt) - a GDB plug-in for GDB command prompt with fzf history search, fish-like autosuggestions, auto-completion with floating window, partial string matching in history, and more!

GEP (GDB Enhanced Prompt) GEP (GDB Enhanced Prompt) is a GDB plug-in which make your GDB command prompt more convenient and flexibility. Why I need th

23 Dec 21, 2022

RoBERTa Marathi Language model trained from scratch during huggingface 🤗 x flax community week

RoBERTa base model for Marathi Language (मराठी भाषा) Pretrained model on Marathi language using a masked language modeling (MLM) objective. RoBERTa wa

23 Oct 19, 2022

[CVPR 2022] Unsupervised Image-to-Image Translation with Generative Prior

GP-UNIT - Official PyTorch Implementation This repository provides the official PyTorch implementation for the following paper: Unsupervised Image-to-

125 Jan 03, 2023

This project is a re-implementation of MASTER: Multi-Aspect Non-local Network for Scene Text Recognition by MMOCR

This project is a re-implementation of MASTER: Multi-Aspect Non-local Network for Scene Text Recognition by MMOCR，which is an open-source toolbox based on PyTorch. The overall architecture will be sh

82 Nov 17, 2022

Defending graph neural networks against adversarial attacks (NeurIPS 2020)

GNNGuard: Defending Graph Neural Networks against Adversarial Attacks Authors: Xiang Zhang ( Zitnik Lab @ Harvard 44 Dec 07, 2022

Tiny-NewsRec: Efﬁcient and Effective PLM-based News Recommendation

Tiny-NewsRec The source codes for our paper "Tiny-NewsRec: Efﬁcient and Effective PLM-based News Recommendation". Requirements PyTorch == 1.6.0 Tensor

3 Dec 07, 2022

Syllabic Quantity Patterns as Rhythmic Features for Latin Authorship Attribution

Syllabic Quantity Patterns as Rhythmic Features for Latin Authorship Attribution Abstract Within the Latin (and ancient Greek) production, it is well

4 Dec 03, 2022

Code of our paper "Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning"

CCOP Code of our paper Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning Requirement Install OpenSelfSup Install Detectron2

21 Dec 13, 2022

A unified 3D Transformer Pipeline for visual synthesis

Overview This is the official repo for the paper: "NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion". NÜWA is a unified multimodal

2.6k Jan 03, 2023

AI-Bot - 一个基于watermelon改造的OpenAI-GPT-2的智能机器人

AI-Bot 一个基于watermelon改造的OpenAI-GPT-2的智能机器人在Binder上直接运行测试目前有两种实现方式 TF2的GPT-2 TF

9 Nov 16, 2022

A selection of State Of The Art research papers (and code) on human locomotion (pose + trajectory) prediction (forecasting)

A selection of State Of The Art research papers (and code) on human trajectory prediction (forecasting). Papers marked with [W] are workshop papers.

40 Nov 18, 2022

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection Main requirements torch = 1.0 torchvision = 0.2.0 Python 3 Environm

15 Apr 04, 2022

"SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image", Dejia Xu, Yifan Jiang, Peihao Wang, Zhiwen Fan, Humphrey Shi, Zhangyang Wang

SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image [Paper] [Website] Pipeline Code Environment pip install -r requirements

250 Jan 05, 2023

A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.

Related tags

Overview

P-tuning

How to use our code

Citation

Owner

THUDM

GEP (GDB Enhanced Prompt) - a GDB plug-in for GDB command prompt with fzf history search, fish-like autosuggestions, auto-completion with floating window, partial string matching in history, and more!

RoBERTa Marathi Language model trained from scratch during huggingface 🤗 x flax community week

[CVPR 2022] Unsupervised Image-to-Image Translation with Generative Prior

This project is a re-implementation of MASTER: Multi-Aspect Non-local Network for Scene Text Recognition by MMOCR

Defending graph neural networks against adversarial attacks (NeurIPS 2020)

Tiny-NewsRec: Efﬁcient and Effective PLM-based News Recommendation

Syllabic Quantity Patterns as Rhythmic Features for Latin Authorship Attribution

Code of our paper "Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning"

A unified 3D Transformer Pipeline for visual synthesis

AI-Bot - 一个基于watermelon改造的OpenAI-GPT-2的智能机器人

A selection of State Of The Art research papers (and code) on human locomotion (pose + trajectory) prediction (forecasting)

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

"SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image", Dejia Xu, Yifan Jiang, Peihao Wang, Zhiwen Fan, Humphrey Shi, Zhangyang Wang

CMSC320 - Introduction to Data Science - Fall 2021

A novel framework to automatically learn high-quality scanning of non-planar, complex anisotropic appearance.

A human-readable PyTorch implementation of "Self-attention Does Not Need O(n^2) Memory"

BADet: Boundary-Aware 3D Object Detection from Point Clouds (Pattern Recognition 2022)

Released code for Objects are Different: Flexible Monocular 3D Object Detection, CVPR21

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

Full Resolution Residual Networks for Semantic Image Segmentation