HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram

Zhifeng Kong Wei Ping Ambrish Dantrey Bryan Catanzaro

CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram

Abstract

In this work, we present CleanUNet 2, a speech denoising model that combines the advantages of waveform denoiser and spectrogram denoiser and achieves the best of both worlds. CleanUNet 2 uses a two-stage framework inspired by popular speech synthesis methods that consist of a waveform model and a spectrogram model. Specifically, CleanUNet 2 builds upon CleanUNet, the state-of-the-art waveform denoiser, and further boosts its performance by taking predicted spectrograms from a spectrogram denoiser as the input. We demonstrate that CleanUNet 2 outperforms previous methods in terms of various objective and subjective evaluations.

Benchmarks

BenchmarkMethodologyMetrics
speech-enhancement-on-deep-noise-suppressionCleanUNet-2
PESQ-NB: 3.658
PESQ-WB: 3.262

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram | Papers | HyperAI