Command Palette
Search for a command to run...
WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System
Xiao Yang ; Das Rohan Kumar

Abstract
This work aims to advance sound event detection (SED) research by presentinga new large language model (LLM)-powered dataset namely wild domesticenvironment sound event detection (WildDESED). It is crafted as an extension tothe original DESED dataset to reflect diverse acoustic variability and complexnoises in home settings. We leveraged LLMs to generate eight different domesticscenarios based on target sound categories of the DESED dataset. Then weenriched the scenarios with a carefully tailored mixture of noises selectedfrom AudioSet and ensured no overlap with target sound. We consider widelypopular convolutional neural recurrent network to study WildDESED dataset,which depicts its challenging nature. We then apply curriculum learning bygradually increasing noise complexity to enhance the model's generalizationcapabilities across various noise levels. Our results with this approach showimprovements within the noisy environment, validating the effectiveness on theWildDESED dataset promoting noise-robust SED advancements.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| sound-event-detection-on-wilddesed | CRNN (WildDESED) | PSDS1 (-5dB): 0.048 PSDS1 (0dB): 0.087 PSDS1 (10dB): 0.175 PSDS1 (5dB): 0.135 PSDS1 (Clean): 0.200 |
| sound-event-detection-on-wilddesed | CRNN | PSDS1 (-5dB): 0.017 PSDS1 (0dB): 0.064 PSDS1 (10dB): 0.222 PSDS1 (5dB): 0.148 PSDS1 (Clean): 0.348 |
| sound-event-detection-on-wilddesed | CRNN (WildDESED + Curriculrm learning) | PSDS1 (-5dB): 0.049 PSDS1 (0dB): 0.114 PSDS1 (10dB): 0.212 PSDS1 (5dB): 0.175 PSDS1 (Clean): 0.265 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.