HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Music Source Separation with Band-split RNN

Yi Luo Jianwei Yu

Music Source Separation with Band-split RNN

Abstract

The performance of music source separation (MSS) models has been greatly improved in recent years thanks to the development of novel neural network architectures and training pipelines. However, recent model designs for MSS were mainly motivated by other audio processing tasks or other research fields, while the intrinsic characteristics and patterns of the music signals were not fully discovered. In this paper, we propose band-split RNN (BSRNN), a frequency-domain model that explictly splits the spectrogram of the mixture into subbands and perform interleaved band-level and sequence-level modeling. The choices of the bandwidths of the subbands can be determined by a priori knowledge or expert knowledge on the characteristics of the target source in order to optimize the performance on a certain type of target musical instrument. To better make use of unlabeled data, we also describe a semi-supervised model finetuning pipeline that can further improve the performance of the model. Experiment results show that BSRNN trained only on MUSDB18-HQ dataset significantly outperforms several top-ranking models in Music Demixing (MDX) Challenge 2021, and the semi-supervised finetuning stage further improves the performance on all four instrument tracks.

Code Repositories

amanteur/BandSplitRNN-Pytorch
pytorch
Mentioned in GitHub
naba89/iseparate-sdx
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
music-source-separation-on-musdb18Band-Split RNN (semi-sup.)
SDR (avg): 8.97
SDR (bass): 8.16
SDR (drums): 10.15
SDR (other): 7.08
SDR (vocals): 10.47
music-source-separation-on-musdb18Band-Split RNN
SDR (avg): 8.23
SDR (bass): 7.51
SDR (drums): 8.58
SDR (other): 6.62
SDR (vocals): 10.21
music-source-separation-on-musdb18-hqBand-Split RNN (semi-sup.)
SDR (avg): 8.97
SDR (bass): 8.16
SDR (drums): 10.15
SDR (others): 7.08
SDR (vocals): 10.47
music-source-separation-on-musdb18-hqBand-Split RNN
SDR (avg): 8.24
SDR (bass): 7.22
SDR (drums): 9.01
SDR (others): 6.70
SDR (vocals): 10.01

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Music Source Separation with Band-split RNN | Papers | HyperAI