HyperAIHyperAI

Command Palette

Search for a command to run...

a month ago

Large-scale, Fast and Accurate Shot Boundary Detection through Spatio-temporal Convolutional Neural Networks

Large-scale, Fast and Accurate Shot Boundary Detection through
  Spatio-temporal Convolutional Neural Networks

Abstract

Shot boundary detection (SBD) is an important pre-processing step for videomanipulation. Here, each segment of frames is classified as either sharp,gradual or no transition. Current SBD techniques analyze hand-crafted featuresand attempt to optimize both detection accuracy and processing speed. However,the heavy computations of optical flow prevents this. To achieve this aim, wepresent an SBD technique based on spatio-temporal Convolutional Neural Networks(CNN). Since current datasets are not large enough to train an accurate SBDCNN, we present a new dataset containing more than 3.5 million frames of sharpand gradual transitions. The transitions are generated synthetically usingimage compositing models. Our dataset contain additional 70,000 frames ofimportant hard-negative no transitions. We perform the largest evaluation todate for one SBD algorithm, on real and synthetic data, containing more than4.85 million frames. In comparison to the state of the art, we outperformdissolve gradual detection, generate competitive performance for sharpdetections and produce significant improvement in wipes. In addition, we are upto 11 times faster than the state of the art.

Code Repositories

soCzech/TransNetV2
tf
Mentioned in GitHub
wqliu657/TransNetV2
tf
Mentioned in GitHub
Tangshitao/ClipShots
Mentioned in GitHub
melgharib/DSBD
Official
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
camera-shot-boundary-detection-on-clipshotsDeepSBD
F1 score: 75.9
camera-shot-boundary-detection-on-msu-shotPyScene
F score: 0.7349
FPS: 86
camera-shot-boundary-detection-on-msu-shotPyScene-v2
F score: 0.7534
FPS: 86

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Large-scale, Fast and Accurate Shot Boundary Detection through Spatio-temporal Convolutional Neural Networks | Papers | HyperAI