HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

EmoWOZ: A Large-Scale Corpus and Labelling Scheme for Emotion Recognition in Task-Oriented Dialogue Systems

Shutong Feng Nurul Lubis Christian Geishauser Hsien-chin Lin Michael Heck Carel van Niekerk Milica Gašić

EmoWOZ: A Large-Scale Corpus and Labelling Scheme for Emotion Recognition in Task-Oriented Dialogue Systems

Abstract

The ability to recognise emotions lends a conversational artificial intelligence a human touch. While emotions in chit-chat dialogues have received substantial attention, emotions in task-oriented dialogues remain largely unaddressed. This is despite emotions and dialogue success having equally important roles in a natural system. Existing emotion-annotated task-oriented corpora are limited in size, label richness, and public availability, creating a bottleneck for downstream tasks. To lay a foundation for studies on emotions in task-oriented dialogues, we introduce EmoWOZ, a large-scale manually emotion-annotated corpus of task-oriented dialogues. EmoWOZ is based on MultiWOZ, a multi-domain task-oriented dialogue dataset. It contains more than 11K dialogues with more than 83K emotion annotations of user utterances. In addition to Wizard-of-Oz dialogues from MultiWOZ, we collect human-machine dialogues within the same set of domains to sufficiently cover the space of various emotions that can happen during the lifetime of a data-driven dialogue system. To the best of our knowledge, this is the first large-scale open-source corpus of its kind. We propose a novel emotion labelling scheme, which is tailored to task-oriented dialogues. We report a set of experimental results to show the usability of this corpus for emotion recognition and state tracking in task-oriented dialogues.

Benchmarks

BenchmarkMethodologyMetrics
emotion-recognition-in-conversation-on-emowozDialogueRNN-GloVe
Macro F1: 46.33
Macro F1 (w/o Neutral): 40.14
Weighted F1: 80.76
Weighted F1 (w/o Neutral): 74.56
emotion-recognition-in-conversation-on-emowozDialogueRNN-BERT
Macro F1: 57.10
Macro F1 (w/o Neutral): 52.15
Weighted F1: 83.41
Weighted F1 (w/o Neutral): 75.50
emotion-recognition-in-conversation-on-emowozContextBERT
Macro F1: 59.79
Macro F1 (w/o Neutral): 54.30
Weighted F1: 88.33
Weighted F1 (w/o Neutral): 79.67
emotion-recognition-in-conversation-on-emowozCOSMIC
Macro F1: 61.12
Macro F1 (w/o Neutral): 56.34
Weighted F1: 85.94
Weighted F1 (w/o Neutral): 77.09
emotion-recognition-in-conversation-on-emowozBERT
Macro F1: 55.80
Macro F1 (w/o Neutral): 50.14
Weighted F1: 84.83
Weighted F1 (w/o Neutral): 73.55

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
EmoWOZ: A Large-Scale Corpus and Labelling Scheme for Emotion Recognition in Task-Oriented Dialogue Systems | Papers | HyperAI