HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

RCT: Random Consistency Training for Semi-supervised Sound Event Detection

Nian Shao Erfan Loweimi Xiaofei Li

RCT: Random Consistency Training for Semi-supervised Sound Event Detection

Abstract

Sound event detection (SED), as a core module of acoustic environmental analysis, suffers from the problem of data deficiency. The integration of semi-supervised learning (SSL) largely mitigates such problem while bringing no extra annotation budget. This paper researches on several core modules of SSL, and introduces a random consistency training (RCT) strategy. First, a self-consistency loss is proposed to fuse with the teacher-student model to stabilize the training. Second, a hard mixup data augmentation is proposed to account for the additive property of sounds. Third, a random augmentation scheme is applied to flexibly combine different types of data augmentations. Experiments show that the proposed strategy outperform other widely-used strategies.

Code Repositories

Audio-WestlakeU/RCT-Random-Consistency-Training
Official
pytorch
Mentioned in GitHub
audio-westlakeu/rct
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
sound-event-detection-on-desedRCT
PSDS1: 0.4395
PSDS2: 0.6711
event-based F1 score: 49.62

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
RCT: Random Consistency Training for Semi-supervised Sound Event Detection | Papers | HyperAI