Command Palette
Search for a command to run...
Odyssey 2024 - Speech Emotion Recognition Challenge: Dataset, Baseline Framework, and Results
{Carlos Busso Berrak Sisman Najim Dehak Leibny Paola Garcia Thomas Thebaud Laureano Moro Velazquez Abinay R. Naini Ali N. Salman Lucas Goncalves}
Abstract
The Odyssey 2024 Speech Emotion Recognition (SER) Challenge aims to enhance innovation in recognizing emotions from spontaneous speech, moving beyond traditional datasets derived from acted scenarios. It offers speaker-independent training, development, and an exclusive test set, all annotated for the two tracks explored in this challenge: categorical and attribute SER tasks. This initiative promotes collaboration among researchers to develop SER technologies that perform accurately in real-world settings, encouraging researchers to explore innovative approaches that leverage the latest advancements in audio processing for SER. In this paper, we provide a detailed description of the baseline, leaderboard, evaluation of the results, and a discussion of the key findings. The competition website with leaderboards, links to baseline code, and instructions can be found here: https://lab-msp.com/MSP-Podcast_Competition/leaderboard.php
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| speech-emotion-recognition-on-msp-podcast | wavlm | CCC: 0.6466753 |
| speech-emotion-recognition-on-msp-podcast-1 | wavlm | CCC: 0.7465055 |
| speech-emotion-recognition-on-msp-podcast-2 | wavlm | CCC: 0.6712493 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.