HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

TransNet V2: An effective deep network architecture for fast shot transition detection

Souček Tomáš ; Lokoč Jakub

TransNet V2: An effective deep network architecture for fast shot
  transition detection

Abstract

Although automatic shot transition detection approaches are alreadyinvestigated for more than two decades, an effective universal human-levelmodel was not proposed yet. Even for common shot transitions like hard cuts orsimple gradual changes, the potential diversity of analyzed video contents maystill lead to both false hits and false dismissals. Recently, deeplearning-based approaches significantly improved the accuracy of shottransition detection using 3D convolutional architectures and artificiallycreated training data. Nevertheless, one hundred percent accuracy is still anunreachable ideal. In this paper, we share the current version of our deepnetwork TransNet V2 that reaches state-of-the-art performance on respectedbenchmarks. A trained instance of the model is provided so it can be instantlyutilized by the community for a highly efficient analysis of large videoarchives. Furthermore, the network architecture, as well as our experience withthe training process, are detailed, including simple code snippets forconvenient usage of the proposed model and visualization of results.

Code Repositories

soCzech/MasterThesis
Mentioned in GitHub
soCzech/TransNetV2
Official
tf
Mentioned in GitHub
wqliu657/TransNetV2
tf
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
camera-shot-boundary-detection-on-clipshotsTransNet V2
F1 score: 77.9

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp