Source code for our paper "Empathetic Response Generation with State Management"

Last update: Oct 08, 2022

Overview

Source code for our paper "Empathetic Response Generation with State Management"

this repository is maintained by both Jun Gao and Yuhan Liu

Model Overview

Environment Requirement

pytorch >= 1.4
sklearn
nltk
numpy
bert-score

Dataset

you can directly use the processed dataset located in data/empathetic:

├── data
│   ├── empathetic
│   │   ├── parsed_emotion_Ekman_intent_test.json
│   │   ├── parsed_emotion_Ekman_intent_train.json
│   │   ├── parsed_emotion_Ekman_intent_valid.json
│   │   ├── emotion_intent_trans.mat
│   │   ├── goEmotion_emotion_trans.mat

Or you want to reproduce the data annotated with goEmotion emotion classifier and empathetic intent classifier, you can run the command:

convert raw csv empathetic dialogue data into json format. (origin dataset link: EmpatheticDialogues)
```
bash preprocess_raw.sh
```
train emotion classfier with goEmotion dataset and annotate (origin dataset link: goEmotion). Here $BERT_DIR is your pretrained BERT model directory which includes vocab.txt, config.json and pytorch_model.bin, here we simply use bert-base-en from Hugginface
```
bash ./bash/emotion_annotate.sh  $BERT_DIR 32 0.00005 16 3 1024 2 0.1
```
train intent classfier with empathetic intent dataset and annotate (origin dataset link: Empathetic_Intent)
```
bash ./bash/intent_annotate.sh  $BERT_DIR 32 0.00005 16 3 1024 2 0.1
```
build prior emotion-emotion and emotion-intent transition matrix
```
bash ./bash/build_transition_mat.sh
```

Train

For training the LM-based model, you need to download bert-base-en and gpt2-small from Hugginface first, then run the following command. Here $GPT_DIR and $BERT_DIR are the downloaded model directory:

bash ./bash/train_LM.sh --gpt_path $GPT_DIR --bert_path $BERT_DIR --gpu_id 2 --epoch 5 --lr_NLU 0.00003 --lr_NLG 0.00008 --bsz_NLU 16 --bsz_NLG 16

for example:

bash ./bash/train_LM.sh --gpt_path /home/liuyuhan/datasets/gpt2-small --bert_path /home/liuyuhan/datasets/bert-base-en bert-base-en --gpu_id 2 --epoch 5 --lr_NLU 0.00003 --lr_NLG 0.00008 --bsz_NLU 16 --bsz_NLG 16

For training the Trs-based model, we use glove.6B.300d as the pretrained word embeddings. You can run the following command to train model. Here $GLOVE is the glove embedding txt file.

bash ./bash/train_Trs.sh --gpu_id 2 --epoch 15 --lr_NLU 0.00007 --lr_NLG 0.0015 --bsz_NLU 16 --bsz_NLG 16 --glove $GLOVE

for example:

bash ./bash/train_Trs.sh --gpu_id 2 --epoch 15 --lr_NLU 0.00007 --lr_NLG 0.0015 --bsz_NLU 16 --bsz_NLG 16 --glove /home/liuyuhan/datasets/glove/glove.6B.300d.txt

Evaluate

To generate the automatic metric results, firstly you need to make sure that bert-score is successfully installed. In our paper, we use roberta-large-en rescaled with baseline to calculate BERTScore. You can download roberta-large-en from Hugginface. For the rescaled_baseline file, we can download it from here and put it under the roberta-large-en model directory.

Then you can run the following command to get the result, here $hypothesis and $reference are the generated response file and ground-truth response file. $result is the output result file. $ROBERTA_DIR is the downloaded roberta-large-en model directory.

To evaluate LM-based model, the command is:

bash ./bash/eval.sh --hyp $hypothesis --ref ./data/empathetic/ref.txt --out $result --bert $ROBERTA_DIR --gpu_id 0 --mode LM

To evaluate Trs-based model, the command is:

bash ./bash/eval.sh --hyp $hypothesis --ref ./data/empathetic/ref_tokenize.txt --out $result --bert $ROBERTA_DIR --gpu_id 0 --mode Trs

Source code for our paper "Empathetic Response Generation with State Management"

Related tags

Overview

Source code for our paper "Empathetic Response Generation with State Management"

Model Overview

Environment Requirement

Dataset

Train

Evaluate

Owner

Yuhan Liu

A geometric deep learning pipeline for predicting protein interface contacts.

Speech recognition tool to convert audio to text transcripts, for Linux and Raspberry Pi.

Kohei's 5th place solution for xview3 challenge

Neural Network to colorize grayscale images

VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning

"Learning and Analyzing Generation Order for Undirected Sequence Models" in Findings of EMNLP, 2021

wgan, wgan2(improved, gp), infogan, and dcgan implementation in lasagne, keras, pytorch

Draw like Bob Ross using the power of Neural Networks (With PyTorch)!

Code examples and benchmarks from the paper "Understanding Entropy Coding With Asymmetric Numeral Systems (ANS): a Statistician's Perspective"

Generic template to bootstrap your PyTorch project with PyTorch Lightning, Hydra, W&B, and DVC.

Facial recognition project

PyTorch implementation of image classification models for CIFAR-10/CIFAR-100/MNIST/FashionMNIST/Kuzushiji-MNIST/ImageNet

Emulation and Feedback Fuzzing of Firmware with Memory Sanitization

This repository contains implementations and illustrative code to accompany DeepMind publications

code for "Self-supervised edge features for improved Graph Neural Network training",

Python scripts to detect faces in Python with the BlazeFace Tensorflow Lite models

Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code

Code for 1st place solution in Sleep AI Challenge SNU Hospital

Display, filter and search log messages in your terminal

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations