8 months ago

Abstract

Respiratory sound classification (RSC) is challenging due to varied acousticsignatures, primarily influenced by patient demographics and recordingenvironments. To address this issue, we introduce a text-audio multimodal modelthat utilizes metadata of respiratory sounds, which provides usefulcomplementary information for RSC. Specifically, we fine-tune a pretrainedtext-audio multimodal model using free-text descriptions derived from the soundsamples' metadata which includes the gender and age of patients, type ofrecording devices, and recording location on the patient's body. Our methodachieves state-of-the-art performance on the ICBHI dataset, surpassing theprevious best result by a notable margin of 1.17%. This result validates theeffectiveness of leveraging metadata and respiratory sound samples in enhancingRSC performance. Additionally, we investigate the model performance in the casewhere metadata is partially unavailable, which may occur in real-world clinicalsetting.

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

8 months ago

Multimodal

Multimodal Representation

June-Woo Kim*1,2, Miika Toikkanen2, Yera Choi3, Seoung-Eun Moon3†, Ho-Young Jung1†

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

8 months ago

Multimodal

Multimodal Representation

June-Woo Kim*1,2, Miika Toikkanen2, Yera Choi3, Seoung-Eun Moon3†, Ho-Young Jung1†

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification

June-Woo Kim*1,2, Miika Toikkanen2, Yera Choi3, Seoung-Eun Moon3†, Ho-Young Jung1†

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification

June-Woo Kim*1,2, Miika Toikkanen2, Yera Choi3, Seoung-Eun Moon3†, Ho-Young Jung1†

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification

June-Woo Kim*1,2, Miika Toikkanen2, Yera Choi3, Seoung-Eun Moon3†, Ho-Young Jung1†

Abstract

Build AI with AI

HyperAI Newsletters