HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement

Jun Chen Zilin Wang Deyi Tuo Zhiyong Wu Shiyin Kang Helen Meng

FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement

Abstract

Previously proposed FullSubNet has achieved outstanding performance in Deep Noise Suppression (DNS) Challenge and attracted much attention. However, it still encounters issues such as input-output mismatch and coarse processing for frequency bands. In this paper, we propose an extended single-channel real-time speech enhancement framework called FullSubNet+ with following significant improvements. First, we design a lightweight multi-scale time sensitive channel attention (MulCA) module which adopts multi-scale convolution and channel attention mechanism to help the network focus on more discriminative frequency bands for noise reduction. Then, to make full use of the phase information in noisy speech, our model takes all the magnitude, real and imaginary spectrograms as inputs. Moreover, by replacing the long short-term memory (LSTM) layers in original full-band model with stacked temporal convolutional network (TCN) blocks, we design a more efficient full-band module called full-band extractor. The experimental results in DNS Challenge dataset show the superior performance of our FullSubNet+, which reaches the state-of-the-art (SOTA) performance and outperforms other existing speech enhancement approaches.

Code Repositories

hit-thusz-rookiecj/fullsubnet-plus
pytorch
Mentioned in GitHub
thuhcsi/fullsubnet-plus
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
speech-enhancement-on-deep-noise-suppressionFullSubNet+
PESQ-NB: 3.666
PESQ-WB: 3.218
SI-SDR-WB: 16.81

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement | Papers | HyperAI