Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

Last update: Jan 04, 2023

Related tags

Text Data & NLP PABEE

Overview

Patience-based Early Exit

Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

NEWS: We now have a better and tidier implementation integrated into Hugging Face transformers!

Citation

If you use this code in your research, please cite our paper:

@inproceedings{zhou2020bert,
 author = {Zhou, Wangchunshu and Xu, Canwen and Ge, Tao and McAuley, Julian and Xu, Ke and Wei, Furu},
 booktitle = {Advances in Neural Information Processing Systems},
 pages = {18330--18341},
 publisher = {Curran Associates, Inc.},
 title = {BERT Loses Patience: Fast and Robust Inference with Early Exit},
 url = {https://proceedings.neurips.cc/paper/2020/file/d4dd111a4fd973394238aca5c05bebe3-Paper.pdf},
 volume = {33},
 year = {2020}
}

Requirement

Our code is built on huggingface/transformers. To use our code, you must clone and install huggingface/transformers.

Training

You can fine-tune a pretrained language model and train the internal classifiers by configuring and running finetune_bert.sh and finetune_albert.sh .

Inference

You can inference with different patience settings by configuring and running patience_infer_albert.sh and patience_infer_bert.sh.

Bug Report and Contribution

If you'd like to contribute and add more tasks (only GLUE is available at this moment), please submit a pull request and contact me. Also, if you find any problem or bug, please report with an issue. Thanks!

Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

Related tags

Overview

Patience-based Early Exit

Citation

Requirement

Training

Inference

Bug Report and Contribution

Owner

Kevin Canwen Xu

Analyse japanese ebooks using MeCab to determine the difficulty level for japanese learners

Beyond Paragraphs: NLP for Long Sequences

Topic Modelling for Humans

Model for recasing and repunctuating ASR transcripts

:mag: Transformers at scale for question answering & neural search. Using NLP via a modular Retriever-Reader-Pipeline. Supporting DPR, Elasticsearch, HuggingFace's Modelhub...

Code Generation using a large neural network called GPT-J

Contains links to publicly available datasets for modeling health outcomes using speech and language.

Predicting the usefulness of reviews given the review text and metadata surrounding the reviews.

Code for Text Prior Guided Scene Text Image Super-Resolution

Data manipulation and transformation for audio signal processing, powered by PyTorch

BiNE: Bipartite Network Embedding

To create a deep learning model which can explain the content of an image in the form of speech through caption generation with attention mechanism on Flickr8K dataset.

DomainWordsDict, Chinese words dict that contains more than 68 domains, which can be used as text classification、knowledge enhance task

Intent parsing and slot filling in PyTorch with seq2seq + attention

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Stanford CoreNLP provides a set of natural language analysis tools written in Java

A minimal code for fairseq vq-wav2vec model inference.

PyJPBoatRace: Python-based Japanese boatrace tools 🚤

This project consists of data analysis and data visualization (done using python)of all IPL seasons from 2008 to 2019 and answering the most asked questions about the IPL.