HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

VRAG: Region Attention Graphs for Content-Based Video Retrieval

Kennard Ng Ser-Nam Lim Gim Hee Lee

VRAG: Region Attention Graphs for Content-Based Video Retrieval

Abstract

Content-based Video Retrieval (CBVR) is used on media-sharing platforms for applications such as video recommendation and filtering. To manage databases that scale to billions of videos, video-level approaches that use fixed-size embeddings are preferred due to their efficiency. In this paper, we introduce Video Region Attention Graph Networks (VRAG) that improves the state-of-the-art of video-level methods. We represent videos at a finer granularity via region-level features and encode video spatio-temporal dynamics through region-level relations. Our VRAG captures the relationships between regions based on their semantic content via self-attention and the permutation invariant aggregation of Graph Convolution. In addition, we show that the performance gap between video-level and frame-level methods can be reduced by segmenting videos into shots and using shot embeddings for video retrieval. We evaluate our VRAG over several video retrieval tasks and achieve a new state-of-the-art for video-level retrieval. Furthermore, our shot-level VRAG shows higher retrieval precision than other existing video-level methods, and closer performance to frame-level methods at faster evaluation speeds. Finally, our code will be made publicly available.

Benchmarks

BenchmarkMethodologyMetrics
video-retrieval-on-fivr-200kVRAG (CS)
mAP (CSVR): 0.678
mAP (DSVR): 0.723
mAP (ISVR): 0.554
video-retrieval-on-fivr-200kVRAG (video)
mAP (CSVR): 0.470
mAP (DSVR): 0.484
mAP (ISVR): 0.399

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
VRAG: Region Attention Graphs for Content-Based Video Retrieval | Papers | HyperAI