HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

D²Net: A Denoising and Dereverberation Network Based on Two-branch Encoder and Dual-path Transformer

{and Ying Hu Yadong Chen Wenbing Wei Liusong Wang}

D²Net: A Denoising and Dereverberation Network Based on Two-branch Encoder and Dual-path Transformer

Abstract

The simultaneous denoising and dereverberation for single-channel mixture speech under the complicated acoustic environment is considered to be a challengeable task. In this paper, we propose a denoising and dereverberation network named as D²Net in which a two-branch encoder (TBE) is designed to extract and selectively fuse features with different granularity. In addition, we design a global-local dual-path transformer (GLDPT) which introduces the local dense synthesizer attention (LDSA) in the dual-path transformer to improve the perception of local information. We evaluated our proposed D²Net and conducted ablation studies on the VoiceBank+DEMAND and WHAMR! datasets. Meanwhile, we chose three types of data in the WHAMR! dataset to verify the ability of the D²Net on the tasks of denoising-only, dereverberation-only, and simultaneous denoising and dereverberation, respectively. Experimental results show that our proposed model outperforms the comparative models, and all achieve better performance on the tasks of simultaneous denoising and dereverberation, dereverberation-only, and denoising-only, while keeping a small number of network parameters.

Benchmarks

BenchmarkMethodologyMetrics
speech-enhancement-on-demandD²Net
CBAK: 3.18
COVL: 3.92
CSIG: 4.63
PESQ (wb): 3.27
STOI: 96

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
D²Net: A Denoising and Dereverberation Network Based on Two-branch Encoder and Dual-path Transformer | Papers | HyperAI