HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Perceptual Contrast Stretching on Target Feature for Speech Enhancement

Rong Chao Cheng Yu Szu-Wei Fu Xugang Lu Yu Tsao

Perceptual Contrast Stretching on Target Feature for Speech Enhancement

Abstract

Speech enhancement (SE) performance has improved considerably owing to the use of deep learning models as a base function. Herein, we propose a perceptual contrast stretching (PCS) approach to further improve SE performance. The PCS is derived based on the critical band importance function and is applied to modify the targets of the SE model. Specifically, the contrast of target features is stretched based on perceptual importance, thereby improving the overall SE performance. Compared with post-processing-based implementations, incorporating PCS into the training phase preserves performance and reduces online computation. Notably, PCS can be combined with different SE model architectures and training criteria. Furthermore, PCS does not affect the causality or convergence of SE model training. Experimental results on the VoiceBank-DEMAND dataset show that the proposed method can achieve state-of-the-art performance on both causal (PESQ score = 3.07) and noncausal (PESQ score = 3.35) SE tasks.

Code Repositories

roychao19477/pcs
Official
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
speech-enhancement-on-demandPCS
COVL: 3.92
CSIG: 4.43
PESQ (wb): 3.35
STOI: 95

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Perceptual Contrast Stretching on Target Feature for Speech Enhancement | Papers | HyperAI