💛 Code and Dataset for our EMNLP 2021 paper: "Perspective-taking and Pragmatics for Generating Empathetic Responses Focused on Emotion Causes"

Last update: Dec 21, 2022

Overview

Perspective-taking and Pragmatics for Generating
Empathetic Responses Focused on Emotion Causes

Official PyTorch implementation and EmoCause evaluation set of our EMNLP 2021 paper 💛
Hyunwoo Kim, Byeongchang Kim, and Gunhee Kim. Perspective-taking and Pragmatics for Generating Empathetic Responses Focused on Emotion Causes. EMNLP, 2021 [Paper coming soon!]

TL;DR: In order to express deeper empathy in dialogues, we argue that responses should focus on the cause of emotions. Inspired by perspective-taking of humans, we propose a generative emotion estimator (GEE) which can recognize emotion cause words solely based on sentence-level emotion labels without word-level annotations (i.e., weak-supervision). To evaluate our approach, we annotate emotion cause words and release the EmoCause evaluation set. We also propose a pragmatics-based method for generating responses focused on targeted words from the context.

Reference

If you use the materials in this repository as part of any published research, we ask you to cite the following paper:

@inproceedings{Kim:2021:empathy,
  title={Perspective-taking and Pragmatics for Generating Empathetic Responses Focused on Emotion Causes},
  author={Kim, Hyunwoo and Kim, Byeongchang and Kim, Gunhee},
  booktitle={EMNLP},
  year=2021
}

Implementation

System Requirements

Python 3.7.9
Pytorch 1.6.0
CUDA 10.2 supported GPU with at least 24GB memory
See environment.yml for details

Environment setup

Our code is built on the ParlAI framework. We recommend you create a conda environment as follows

conda env create -f environment.yml

and activate it with

conda activate focused-empathy
python -m spacey download en

EmoCause evaluation set for weakly-supervised emotion cause recognition

EmoCause is a dataset of annotated emotion cause words in emotional situations from the EmpatheticDialogues valid and test set. The goal is to recognize emotion cause words in sentences by training only on sentence-level emotion labels without word-level labels (i.e., weakly-supervised emotion cause recognition). EmoCause is based on the fact that humans do not recognize the cause of emotions with supervised learning on word-level cause labels. Thus, we do not provide a training set.

You can download the EmoCause eval set [here].
Note, the dataset will be downloaded automatically when you run the experiment command below.

Data statistics and structure

	#Emotion	Label type	#Label/Utterance	#Utterance
EmoCause	32	Word	2.3	4.6K

{
  "original_situation": the original situations in the EmpatheticDialogues,
  "tokenized_situation": tokenized situation utterances using spacy,
  "emotion": emotion labels,
  "conv_id": id for each corresponding conversation in EmpatheticDialogues,
  "annotation": list of tuples: (emotion cause word, index),
  "labels": list of strings containing the emotion cause words
}

Running Experiments

All corresponding models will be downloaded automatically when running the following commands.
We also provide manual download links: [GEE] [Finetuned Blender]

Weakly-supervised emotion cause word recognition with GEE on EmoCause

You can evaluate our proposed Generative Emotion Estimator (GEE) on the EmoCause eval set.

python eval_emocause.py --model agents.gee_agent:GeeCauseInferenceAgent --fp16 False

Focused empathetic response generation with finetuned Blender on EmpatheticDialogues

You can evaluate our approach for generating focused empathetic responses on top of a finetuned Blender (Not familiar with Blender? See here!).

python eval_empatheticdialogues.py --model agents.empathetic_gee_blender:EmpatheticBlenderAgent --model_file data/models/finetuned_blender90m/model --fp16 False --empathy-score False

Adding the --alpha 0 flag will run the Blender without pragmatics. You can also try the random distractor (Plain S1) by adding --distractor-type random.

💡 To measure the Interpretation and Exploration scores also, set the --empathy-score to True. It will automatically download the RoBERTa models finetuned on EmpatheticDialogues. For more details on empathy scores, visit the original repo.

Acknowledgements

We thank the anonymous reviewers for their helpful comments on this work.

This research was supported by Samsung Research Funding Center of Samsung Electronics under project number SRFCIT210101. The compute resource and human study are supported by Brain Research Program by National Research Foundation of Korea (NRF) (2017M3C7A1047860).

Have any question?

Please contact Hyunwoo Kim at hyunw.kim at vl dot snu dot ac dot kr.

License

This repository is MIT licensed. See the LICENSE file for details.

💛 Code and Dataset for our EMNLP 2021 paper: "Perspective-taking and Pragmatics for Generating Empathetic Responses Focused on Emotion Causes"

Related tags

Overview

Perspective-taking and Pragmatics for Generating
Empathetic Responses Focused on Emotion Causes

Reference

Implementation

System Requirements

Environment setup

EmoCause evaluation set for weakly-supervised emotion cause recognition

Data statistics and structure

Running Experiments

Weakly-supervised emotion cause word recognition with GEE on EmoCause

Focused empathetic response generation with finetuned Blender on EmpatheticDialogues

Acknowledgements

Have any question?

License

Owner

Hyunwoo Kim

A full spaCy pipeline and models for scientific/biomedical documents.

Mednlp - Medical natural language parsing and utility library

Local cross-platform machine translation GUI, based on CTranslate2

MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert, MILES uses the bert-base-multilingual-uncased model, as well as simple language-agnostic approaches to complex word identification (CWI) and candidate ranking.

The (extremely) naive sentiment classification function based on NBSVM trained on wisesight_sentiment

Code for the ACL 2021 paper "Structural Guidance for Transformer Language Models"

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

DiY Oxygen Concentrator based on the OxiKit

Deeply Supervised, Layer-wise Prediction-aware (DSLP) Transformer for Non-autoregressive Neural Machine Translation

Question and answer retrieval in Turkish with BERT

Model for recasing and repunctuating ASR transcripts

A python project made to generate code using either OpenAI's codex or GPT-J (Although not as good as codex)

Repository for the paper: VoiceMe: Personalized voice generation in TTS

A CSRankings-like index for speech researchers

⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).

Bu Chatbot, Konya Bilim Merkezi Yen için tasarlanmış olan bir projedir.

This is a MD5 password/passphrase brute force tool

Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks

KoBERTopic은 BERTopic을 한국어 데이터에 적용할 수 있도록 토크나이저와 BERT를 수정한 코드입니다.

💛 Code and Dataset for our EMNLP 2021 paper: "Perspective-taking and Pragmatics for Generating Empathetic Responses Focused on Emotion Causes"

Related tags

Overview

Perspective-taking and Pragmatics for GeneratingEmpathetic Responses Focused on Emotion Causes

Reference

Implementation

System Requirements

Environment setup

EmoCause evaluation set for weakly-supervised emotion cause recognition

Data statistics and structure

Running Experiments

Weakly-supervised emotion cause word recognition with GEE on EmoCause

Focused empathetic response generation with finetuned Blender on EmpatheticDialogues

Acknowledgements

Have any question?

License

Owner

Hyunwoo Kim

A full spaCy pipeline and models for scientific/biomedical documents.

Mednlp - Medical natural language parsing and utility library

Local cross-platform machine translation GUI, based on CTranslate2

MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert, MILES uses the bert-base-multilingual-uncased model, as well as simple language-agnostic approaches to complex word identification (CWI) and candidate ranking.

The (extremely) naive sentiment classification function based on NBSVM trained on wisesight_sentiment

Code for the ACL 2021 paper "Structural Guidance for Transformer Language Models"

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含 自然语言处理各领域的 面试题积累。

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

DiY Oxygen Concentrator based on the OxiKit

Deeply Supervised, Layer-wise Prediction-aware (DSLP) Transformer for Non-autoregressive Neural Machine Translation

Question and answer retrieval in Turkish with BERT

Model for recasing and repunctuating ASR transcripts

A python project made to generate code using either OpenAI's codex or GPT-J (Although not as good as codex)

Repository for the paper: VoiceMe: Personalized voice generation in TTS

A CSRankings-like index for speech researchers

⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).

Bu Chatbot, Konya Bilim Merkezi Yen için tasarlanmış olan bir projedir.

This is a MD5 password/passphrase brute force tool

Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks

KoBERTopic은 BERTopic을 한국어 데이터에 적용할 수 있도록 토크나이저와 BERT를 수정한 코드입니다.

Perspective-taking and Pragmatics for Generating
Empathetic Responses Focused on Emotion Causes

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。