HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System

Xiao Yang ; Das Rohan Kumar

WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound
  Event Detection System

Abstract

This work aims to advance sound event detection (SED) research by presentinga new large language model (LLM)-powered dataset namely wild domesticenvironment sound event detection (WildDESED). It is crafted as an extension tothe original DESED dataset to reflect diverse acoustic variability and complexnoises in home settings. We leveraged LLMs to generate eight different domesticscenarios based on target sound categories of the DESED dataset. Then weenriched the scenarios with a carefully tailored mixture of noises selectedfrom AudioSet and ensured no overlap with target sound. We consider widelypopular convolutional neural recurrent network to study WildDESED dataset,which depicts its challenging nature. We then apply curriculum learning bygradually increasing noise complexity to enhance the model's generalizationcapabilities across various noise levels. Our results with this approach showimprovements within the noisy environment, validating the effectiveness on theWildDESED dataset promoting noise-robust SED advancements.

Code Repositories

swagshaw/wilddesed
Official
tf
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
sound-event-detection-on-wilddesedCRNN (WildDESED)
PSDS1 (-5dB): 0.048
PSDS1 (0dB): 0.087
PSDS1 (10dB): 0.175
PSDS1 (5dB): 0.135
PSDS1 (Clean): 0.200
sound-event-detection-on-wilddesedCRNN
PSDS1 (-5dB): 0.017
PSDS1 (0dB): 0.064
PSDS1 (10dB): 0.222
PSDS1 (5dB): 0.148
PSDS1 (Clean): 0.348
sound-event-detection-on-wilddesedCRNN (WildDESED + Curriculrm learning)
PSDS1 (-5dB): 0.049
PSDS1 (0dB): 0.114
PSDS1 (10dB): 0.212
PSDS1 (5dB): 0.175
PSDS1 (Clean): 0.265

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System | Papers | HyperAI