HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Multi-label Music Genre Classification from Audio, Text, and Images Using Deep Features

Sergio Oramas; Oriol Nieto; Francesco Barbieri; Xavier Serra

Multi-label Music Genre Classification from Audio, Text, and Images Using Deep Features

Abstract

Music genres allow to categorize musical items that share common characteristics. Although these categories are not mutually exclusive, most related research is traditionally focused on classifying tracks into a single class. Furthermore, these categories (e.g., Pop, Rock) tend to be too broad for certain applications. In this work we aim to expand this task by categorizing musical items into multiple and fine-grained labels, using three different data modalities: audio, text, and images. To this end we present MuMu, a new dataset of more than 31k albums classified into 250 genre classes. For every album we have collected the cover image, text reviews, and audio tracks. Additionally, we propose an approach for multi-label genre classification based on the combination of feature embeddings learned with state-of-the-art deep learning methodologies. Experiments show major differences between modalities, which not only introduce new baselines for multi-label genre classification, but also suggest that combining them yields improved results.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
genre-classification-on-fmacnn
CNN: 855

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Multi-label Music Genre Classification from Audio, Text, and Images Using Deep Features | Papers | HyperAI