HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Hierarchical Pre-training for Sequence Labelling in Spoken Dialog

Emile Chapuis Pierre Colombo Matteo Manica Matthieu Labeau Chloe Clavel

Hierarchical Pre-training for Sequence Labelling in Spoken Dialog

Abstract

Sequence labelling tasks like Dialog Act and Emotion/Sentiment identification are a key component of spoken dialog systems. In this work, we propose a new approach to learn generic representations adapted to spoken dialog, which we evaluate on a new benchmark we call Sequence labellIng evaLuatIon benChmark fOr spoken laNguagE benchmark (\texttt{SILICONE}). \texttt{SILICONE} is model-agnostic and contains 10 different datasets of various sizes. We obtain our representations with a hierarchical encoder based on transformer architectures, for which we extend two well-known pre-training objectives. Pre-training is performed on OpenSubtitles: a large corpus of spoken dialog containing over $2.3$ billion of tokens. We demonstrate how hierarchical encoders achieve competitive results with consistently fewer parameters compared to state-of-the-art models and we show their importance for both pre-training and fine-tuning.

Benchmarks

BenchmarkMethodologyMetrics
dialogue-act-classification-on-icsi-meetingPretrained Hierarchical Transformer
Accuracy: 92.4
dialogue-act-classification-on-switchboardPretrained Hierarchical Transformer
Accuracy: 79.2
emotion-recognition-in-conversation-onPretrained Hierarchical Transformer
Accuracy: 66.05
Weighted-F1: 65.37
emotion-recognition-in-conversation-on-2Pretrained Hierarchical Transformer
MAE (Arousal): 0.16
MAE (Expectancy): 0.16
MAE (Power): 7.70
MAE (Valence): 0.16
emotion-recognition-in-conversation-on-3Pretrained Hierarchical Transformer
Micro-F1: 60.14
emotion-recognition-in-conversation-on-meldPretrained Hierarchical Transformer
Weighted-F1: 61.90
text-classification-on-silicone-benchmarkPretrained Hierarchical Transformer
1:1 Accuracy: 71.25

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Hierarchical Pre-training for Sequence Labelling in Spoken Dialog | Papers | HyperAI