HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Symphony Generation with Permutation Invariant Language Model

Jiafeng Liu Yuanliang Dong Zehua Cheng Xinran Zhang Xiaobing Li Feng Yu Maosong Sun

Symphony Generation with Permutation Invariant Language Model

Abstract

In this work, we propose a permutation invariant language model, SymphonyNet, as a solution for symbolic symphony music generation. We propose a novel Multi-track Multi-instrument Repeatable (MMR) representation for symphonic music and model the music sequence using a Transformer-based auto-regressive language model with specific 3-D positional embedding. To overcome length overflow when modeling extra-long symphony tokens, we also propose a modified Byte Pair Encoding algorithm (Music BPE) for music tokens and introduce a novel linear transformer decoder architecture as a backbone. Meanwhile, we train the decoder to learn automatic orchestration as a joint task by masking instrument information from the input. We also introduce a large-scale symbolic symphony dataset for the advance of symphony generation research. Empirical results show that the proposed approach can generate coherent, novel, complex and harmonious symphony as a pioneer solution for multi-track multi-instrument symbolic music generation.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
audio-generation-on-symphony-musicSymphonyNet
Human listening average results: 3.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Symphony Generation with Permutation Invariant Language Model | Papers | HyperAI