HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection

Daisuke Niizumi; Daiki Takeuchi; Yasunori Ohishi; Noboru Harada; Kunio Kashino

Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection

Abstract

To reduce the need for skilled clinicians in heart sound interpretation, recent studies on automating cardiac auscultation have explored deep learning approaches. However, despite the demands for large data for deep learning, the size of the heart sound datasets is limited, and no pre-trained model is available. On the contrary, many pre-trained models for general audio tasks are available as general-purpose audio representations. This study explores the potential of general-purpose audio representations pre-trained on large-scale datasets for transfer learning in heart murmur detection. Experiments on the CirCor DigiScope heart sound dataset show that the recent self-supervised learning Masked Modeling Duo (M2D) outperforms previous methods with the results of a weighted accuracy of 0.832 and an unweighted average recall of 0.713. Experiments further confirm improved performance by ensembling M2D with other models. These results demonstrate the effectiveness of general-purpose audio representation in processing heart sounds and open the way for further applications. Our code is available online which runs on a 24 GB consumer GPU at https://github.com/nttcslab/m2d/tree/master/app/circor

Code Repositories

nttcslab/m2d
Official
pytorch
Mentioned in GitHub
nttcslab/eval-audio-repr
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
classify-murmurs-on-circor-digiscopeM2D
Unweighted average recall: 0.713
Weighted Accuracy: 0.832

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection | Papers | HyperAI