HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Relational Reasoning Over Spatial-Temporal Graphs for Video Summarization

{Jie zhou Jiwen Lu Yucheng Han Wencheng Zhu}

Abstract

In this paper, we propose a dynamic graph modeling approach to learn spatial-temporal representations for video summarization. Most existing video summarization methods extract image-level features with ImageNet pre-trained deep models. Differently, our method exploits object-level and relation-level information to capture spatial-temporal dependencies. Specifically, our method builds spatial graphs on the detected object proposals. Then, we construct a temporal graph by using the aggregated representations of spatial graphs. Afterward, we perform relational reasoning over spatial and temporal graphs with graph convolutional networks and extract spatial-temporal representations for importance score prediction and key shot selection. To eliminate relation clutters caused by densely connected nodes, we further design a self-attention edge pooling module, which disregards meaningless relations of graphs. We conduct extensive experiments on two popular benchmarks, including the SumMe and TVSum datasets. Experimental results demonstrate that the proposed method achieves superior performance against state-of-the-art video summarization methods.

Benchmarks

BenchmarkMethodologyMetrics
graph-classification-on-nci1SAEPool_g
Accuracy: 74.48%
graph-classification-on-nci109SAEPool_h
Accuracy: 75.85
graph-classification-on-proteinsSAEPool
Accuracy: 80.36%
supervised-video-summarization-on-summeRR-STG
F1-score (Augmented): 54.8
F1-score (Canonical): 53.4
Kendall's Tau: 0.211
Spearman's Rho: 0.234
supervised-video-summarization-on-tvsumRR-STG
F1-score (Augmented): 63.6
F1-score (Canonical): 63.0
Kendall's Tau: 0.162
Spearman's Rho: 0.212
video-summarization-on-summeRR-STG
F1-score (Augmented): 55.3
F1-score (Canonical): 54.5
video-summarization-on-tvsumRR-STG
F1-score (Augmented): 63.6
F1-score (Canonical): 63.0

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Relational Reasoning Over Spatial-Temporal Graphs for Video Summarization | Papers | HyperAI