HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification

Sangmin Bae June-Woo Kim Won-Yang Cho Hyerim Baek Soyoun Son Byungjo Lee Changwan Ha Kyongpil Tae Sungnyun Kim Se-Young Yun

Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification

Abstract

Respiratory sound contains crucial information for the early diagnosis of fatal lung diseases. Since the COVID-19 pandemic, there has been a growing interest in contact-free medical care based on electronic stethoscopes. To this end, cutting-edge deep learning models have been developed to diagnose lung diseases; however, it is still challenging due to the scarcity of medical data. In this study, we demonstrate that the pretrained model on large-scale visual and audio datasets can be generalized to the respiratory sound classification task. In addition, we introduce a straightforward Patch-Mix augmentation, which randomly mixes patches between different samples, with Audio Spectrogram Transformer (AST). We further propose a novel and effective Patch-Mix Contrastive Learning to distinguish the mixed representations in the latent space. Our method achieves state-of-the-art performance on the ICBHI dataset, outperforming the prior leading score by an improvement of 4.08%.

Code Repositories

raymin0223/patch-mix_contrastive_learning
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
audio-classification-on-icbhi-respiratoryAST (Patch-Mix CL)
ICBHI Score: 62.37
Sensitivity: 43.07
Specificity: 81.66
audio-classification-on-icbhi-respiratoryAST (fine-tuning)
Sensitivity: 41.97
Specificity: 77.14
audio-classification-on-icbhi-respiratoryAST (fine-tuning)
ICBHI Score: 59.55

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification | Papers | HyperAI