HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward

Kaiyang Zhou; Yu Qiao; Tao Xiang

Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward

Abstract

Video summarization aims to facilitate large-scale video browsing by producing short, concise summaries that are diverse and representative of original videos. In this paper, we formulate video summarization as a sequential decision-making process and develop a deep summarization network (DSN) to summarize videos. DSN predicts for each video frame a probability, which indicates how likely a frame is selected, and then takes actions based on the probability distributions to select frames, forming video summaries. To train our DSN, we propose an end-to-end, reinforcement learning-based framework, where we design a novel reward function that jointly accounts for diversity and representativeness of generated summaries and does not rely on labels or user interactions at all. During training, the reward function judges how diverse and representative the generated summaries are, while DSN strives for earning higher rewards by learning to produce more diverse and more representative summaries. Since labels are not required, our method can be fully unsupervised. Extensive experiments on two benchmark datasets show that our unsupervised method not only outperforms other state-of-the-art unsupervised methods, but also is comparable to or even superior than most of published supervised approaches.

Code Repositories

KaiyangZhou/vsumm-reinforce
Official
pytorch
Mentioned in GitHub
ymlwww/rlproject
pytorch
Mentioned in GitHub
KaiyangZhou/pytorch-vsumm-reinforce
pytorch
Mentioned in GitHub
neda60/video-summarization-using-FCSN
pytorch
Mentioned in GitHub
n9839950/EGH400-2-DSN
pytorch
Mentioned in GitHub
HoganZhang/pytorch-vsumm-reinforce
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
supervised-video-summarization-on-summeDR-DSN
F1-score (Augmented): 43.9
F1-score (Canonical): 42.1
supervised-video-summarization-on-tvsumDR-DSN
F1-score (Augmented): 59.8
F1-score (Canonical): 58.1
unsupervised-video-summarization-on-summeDR-DSN
F1-score: 41.4
Parameters (M): 2.63
training time (s): 19.8
unsupervised-video-summarization-on-tvsumDR-DSN
F1-score: 57.6
Kendall's Tau: 0.020
Parameters (M): 2.63
Spearman's Rho: 0.026
training time (s): 58.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward | Papers | HyperAI