Command Palette
Search for a command to run...
Emile Chapuis Pierre Colombo Matteo Manica Matthieu Labeau Chloe Clavel

Abstract
Sequence labelling tasks like Dialog Act and Emotion/Sentiment identification are a key component of spoken dialog systems. In this work, we propose a new approach to learn generic representations adapted to spoken dialog, which we evaluate on a new benchmark we call Sequence labellIng evaLuatIon benChmark fOr spoken laNguagE benchmark (\texttt{SILICONE}). \texttt{SILICONE} is model-agnostic and contains 10 different datasets of various sizes. We obtain our representations with a hierarchical encoder based on transformer architectures, for which we extend two well-known pre-training objectives. Pre-training is performed on OpenSubtitles: a large corpus of spoken dialog containing over $2.3$ billion of tokens. We demonstrate how hierarchical encoders achieve competitive results with consistently fewer parameters compared to state-of-the-art models and we show their importance for both pre-training and fine-tuning.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| dialogue-act-classification-on-icsi-meeting | Pretrained Hierarchical Transformer | Accuracy: 92.4 |
| dialogue-act-classification-on-switchboard | Pretrained Hierarchical Transformer | Accuracy: 79.2 |
| emotion-recognition-in-conversation-on | Pretrained Hierarchical Transformer | Accuracy: 66.05 Weighted-F1: 65.37 |
| emotion-recognition-in-conversation-on-2 | Pretrained Hierarchical Transformer | MAE (Arousal): 0.16 MAE (Expectancy): 0.16 MAE (Power): 7.70 MAE (Valence): 0.16 |
| emotion-recognition-in-conversation-on-3 | Pretrained Hierarchical Transformer | Micro-F1: 60.14 |
| emotion-recognition-in-conversation-on-meld | Pretrained Hierarchical Transformer | Weighted-F1: 61.90 |
| text-classification-on-silicone-benchmark | Pretrained Hierarchical Transformer | 1:1 Accuracy: 71.25 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.