HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Sudo rm -rf: Efficient Networks for Universal Audio Source Separation

Efthymios Tzinis Zhepei Wang Paris Smaragdis

Sudo rm -rf: Efficient Networks for Universal Audio Source Separation

Abstract

In this paper, we present an efficient neural network for end-to-end general purpose audio source separation. Specifically, the backbone structure of this convolutional network is the SUccessive DOwnsampling and Resampling of Multi-Resolution Features (SuDoRMRF) as well as their aggregation which is performed through simple one-dimensional convolutions. In this way, we are able to obtain high quality audio source separation with limited number of floating point operations, memory requirements, number of parameters and latency. Our experiments on both speech and environmental sound separation datasets show that SuDoRMRF performs comparably and even surpasses various state-of-the-art approaches with significantly higher computational resource requirements.

Code Repositories

mpariente/asteroid
pytorch
Mentioned in GitHub
etzinis/sudo_rm_rf
Official
pytorch
Mentioned in GitHub
udase-chime2023/baseline
pytorch
Mentioned in GitHub
etzinis/unsup_speech_enh_adaptation
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
speech-separation-on-whamrSudo rm -rf (U=16)
SI-SDRi: 12.1
speech-separation-on-wsj0-2mixSudo rm -rf XL
SI-SDRi: 18.9

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Sudo rm -rf: Efficient Networks for Universal Audio Source Separation | Papers | HyperAI