PyTorch implementation of the paper: Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

Last update: Dec 14, 2022

Related tags

Text Data & NLP ProSLU

Overview

Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

This repository contains the official PyTorch implementation of the paper:

Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding. Xiao Xu*, Libo Qin*, Kaiji Chen, Guoxing Wu, Linlin Li, Wanxiang Che. AAAI 2022. [Paper(Arxiv)] [Paper]

If you use any source codes or the datasets included in this toolkit in your work, please cite the following paper. The bibtex are listed below:

...

In the following, we will guide you how to use this repository step by step.

Workflow

Architecture

Results

Preparation

Our code is based on the following packages:

numpy==1.19.5
tqdm==4.50.2
pytorch==1.7.0
python==3.7.3
cudatoolkit==11.0.3
transformers==4.1.1

We highly suggest you using Anaconda to manage your python environment.

We download the chinese pretrained model checkpoints from the following links:

How to Run it

The script train.py acts as a main function to the project, you can run the experiments by the following commands.

# LSTM w/o Profile on TITAN Xp
python train.py -g -fs -es -uf -bs 8 -lr 0.0006
# LSTM w/ Profile on TITAN Xp
python train.py -g -fs -es -uf -ui -bs 8 -lr 0.0004
# BERT w/o Profile on Tesla V100s PCIE 32GB
python train.py -g -fs -es -uf -up -mt XLNet -bs 8 -lr 0.001 -blr 4e-05
# BERT w/ Profile on Tesla V100 PCIE 32GB
python train.py -g -fs -es -uf -up -ui -mt ELECTRA -bs 8 -lr 0.0008 -blr 4e-05

If you have any question, please issue the project or email me or lbqin, and we will reply you soon.

Acknowledgement

We are highly grateful for the public code of Stack-Propagation!

A Stack-Propagation Framework with Token-Level Intent Detection for Spoken Language Understanding. Libo Qin,Wanxiang Che, Yangming Li, Haoyang Wen and Ting Liu. (EMNLP 2019). Long paper. [pdf] [code]
We are highly grateful for the open-source knowledge graph!
- CN-DBpedia
- OwnThink

PyTorch implementation of the paper: Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

Related tags

Overview

Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

Workflow

Architecture

Results

Preparation

How to Run it

Acknowledgement

Owner

Xiao Xu

Mycroft Core, the Mycroft Artificial Intelligence platform.

A CSRankings-like index for speech researchers

An open source library for deep learning end-to-end dialog systems and chatbots.

Rethinking the Truly Unsupervised Image-to-Image Translation - Official PyTorch Implementation (ICCV 2021)

無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン

Python code for ICLR 2022 spotlight paper EViT: Expediting Vision Transformers via Token Reorganizations

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing

Multilingual word vectors in 78 languages

Long text token classification using LongFormer

Visual Automata is a Python 3 library built as a wrapper for Caleb Evans' Automata library to add more visualization features.

Must-read papers on improving efficiency for pre-trained language models.

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Understand Text Summarization and create your own summarizer in python

Parrot is a paraphrase based utterance augmentation framework purpose built to accelerate training NLU models

The ability of computer software to identify words and phrases in spoken language and convert them to human-readable text

Code and checkpoints for training the transformer-based Table QA models introduced in the paper TAPAS: Weakly Supervised Table Parsing via Pre-training.

a CTF web challenge about making screenshots

This github repo is for Neurips 2021 paper, NORESQA A Framework for Speech Quality Assessment using Non-Matching References.

🏖 Easy training and deployment of seq2seq models.