HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Multimodal Transformer for Unaligned Multimodal Language Sequences

Yao-Hung Hubert Tsai; Shaojie Bai; Paul Pu Liang; J. Zico Kolter; Louis-Philippe Morency; Ruslan Salakhutdinov

Multimodal Transformer for Unaligned Multimodal Language Sequences

Abstract

Human language is often multimodal, which comprehends a mixture of natural language, facial gestures, and acoustic behaviors. However, two major challenges in modeling such multimodal human language time-series data exist: 1) inherent data non-alignment due to variable sampling rates for the sequences from each modality; and 2) long-range dependencies between elements across modalities. In this paper, we introduce the Multimodal Transformer (MulT) to generically address the above issues in an end-to-end manner without explicitly aligning the data. At the heart of our model is the directional pairwise crossmodal attention, which attends to interactions between multimodal sequences across distinct time steps and latently adapt streams from one modality to another. Comprehensive experiments on both aligned and non-aligned multimodal time-series show that our model outperforms state-of-the-art methods by a large margin. In addition, empirical analysis suggests that correlated crossmodal signals are able to be captured by the proposed crossmodal attention mechanism in MulT.

Code Repositories

JhnLee/multimodal-transformer
pytorch
Mentioned in GitHub
yaohungt/Multimodal-Transformer
Official
pytorch
Mentioned in GitHub
kenford953/graphcage
pytorch
Mentioned in GitHub
pliang279/MFN
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
multimodal-sentiment-analysis-on-mosiMulT
Accuracy: 83
F1 score: 82.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp