HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Query Twice: Dual Mixture Attention Meta Learning for Video Summarization

Junyan Wang; Yang Bai; Yang Long; Bingzhang Hu; Zhenhua Chai; Yu Guan; Xiaolin Wei

Query Twice: Dual Mixture Attention Meta Learning for Video Summarization

Abstract

Video summarization aims to select representative frames to retain high-level information, which is usually solved by predicting the segment-wise importance score via a softmax function. However, softmax function suffers in retaining high-rank representations for complex visual or sequential information, which is known as the Softmax Bottleneck problem. In this paper, we propose a novel framework named Dual Mixture Attention (DMASum) model with Meta Learning for video summarization that tackles the softmax bottleneck problem, where the Mixture of Attention layer (MoA) effectively increases the model capacity by employing twice self-query attention that can capture the second-order changes in addition to the initial query-key attention, and a novel Single Frame Meta Learning rule is then introduced to achieve more generalization to small datasets with limited training sources. Furthermore, the DMASum significantly exploits both visual and sequential attention that connects local key-frame and global attention in an accumulative way. We adopt the new evaluation protocol on two public datasets, SumMe, and TVSum. Both qualitative and quantitative experiments manifest significant improvements over the state-of-the-art methods.

Benchmarks

BenchmarkMethodologyMetrics
supervised-video-summarization-on-summeDMASum
F1-score (Canonical): 54.3
Kendall's Tau: 0.063
Spearman's Rho: 0.089
supervised-video-summarization-on-tvsumDMASum
F1-score (Canonical): 61.4
Kendall's Tau: 0.203
Spearman's Rho: 0.267

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Query Twice: Dual Mixture Attention Meta Learning for Video Summarization | Papers | HyperAI