HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

LSSED: a large-scale dataset and benchmark for speech emotion recognition

Weiquan Fan Xiangmin Xu Xiaofen Xing Weidong Chen Dongyan Huang

LSSED: a large-scale dataset and benchmark for speech emotion recognition

Abstract

Speech emotion recognition is a vital contributor to the next generation of human-computer interaction (HCI). However, current existing small-scale databases have limited the development of related research. In this paper, we present LSSED, a challenging large-scale english speech emotion dataset, which has data collected from 820 subjects to simulate real-world distribution. In addition, we release some pre-trained models based on LSSED, which can not only promote the development of speech emotion recognition, but can also be transferred to related downstream tasks such as mental health analysis where data is extremely difficult to collect. Finally, our experiments show the necessity of large-scale datasets and the effectiveness of pre-trained models. The dateset will be released on https://github.com/tobefans/LSSED.

Code Repositories

tobefans/LSSED
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
speech-emotion-recognition-on-lssedPyResNet
Unweighted Accuracy (UA): 0.429

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
LSSED: a large-scale dataset and benchmark for speech emotion recognition | Papers | HyperAI