HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Speech Denoising in the Waveform Domain with Self-Attention

Zhifeng Kong Wei Ping Ambrish Dantrey Bryan Catanzaro

Speech Denoising in the Waveform Domain with Self-Attention

Abstract

In this work, we present CleanUNet, a causal speech denoising model on the raw waveform. The proposed model is based on an encoder-decoder architecture combined with several self-attention blocks to refine its bottleneck representations, which is crucial to obtain good results. The model is optimized through a set of losses defined over both waveform and multi-resolution spectrograms. The proposed method outperforms the state-of-the-art models in terms of denoised speech quality from various objective and subjective evaluation metrics. We release our code and models at https://github.com/nvidia/cleanunet.

Code Repositories

nvidia/cleanunet
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
speech-enhancement-on-deep-noise-suppressionCleanUNet
PESQ-NB: 3.551
PESQ-WB: 3.146

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Speech Denoising in the Waveform Domain with Self-Attention | Papers | HyperAI