HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

TräumerAI: Dreaming Music with StyleGAN

Dasaem Jeong; Seungheon Doh; Taegyun Kwon

TräumerAI: Dreaming Music with StyleGAN

Abstract

The goal of this paper to generate a visually appealing video that responds to music with a neural network so that each frame of the video reflects the musical characteristics of the corresponding audio clip. To achieve the goal, we propose a neural music visualizer directly mapping deep music embeddings to style embeddings of StyleGAN, named TräumerAI, which consists of a music auto-tagging model using short-chunk CNN and StyleGAN2 pre-trained on WikiArt dataset. Rather than establishing an objective metric between musical and visual semantics, we manually labeled the pairs in a subjective manner. An annotator listened to 100 music clips of 10 seconds long and selected an image that suits the music among the 200 StyleGAN-generated examples. Based on the collected data, we trained a simple transfer function that converts an audio embedding to a style embedding. The generated examples show that the mapping between audio and video makes a certain level of intra-segment similarity and inter-segment dissimilarity.

Code Repositories

jdasam/traeumerAI
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
music-auto-tagging-on-timetravelFellini
0..5sec: 5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
TräumerAI: Dreaming Music with StyleGAN | Papers | HyperAI