HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Hierarchical Conditional Relation Networks for Video Question Answering

Thao Minh Le Vuong Le Svetha Venkatesh Truyen Tran

Hierarchical Conditional Relation Networks for Video Question Answering

Abstract

Video question answering (VideoQA) is challenging as it requires modeling capacity to distill dynamic visual artifacts and distant relations and to associate them with linguistic concepts. We introduce a general-purpose reusable neural unit called Conditional Relation Network (CRN) that serves as a building block to construct more sophisticated structures for representation and reasoning over video. CRN takes as input an array of tensorial objects and a conditioning feature, and computes an array of encoded output objects. Model building becomes a simple exercise of replication, rearrangement and stacking of these reusable units for diverse modalities and contextual information. This design thus supports high-order relational and multi-step reasoning. The resulting architecture for VideoQA is a CRN hierarchy whose branches represent sub-videos or clips, all sharing the same question as the contextual condition. Our evaluations on well-known datasets achieved new SoTA results, demonstrating the impact of building a general-purpose reasoning unit on complex domains such as VideoQA.

Code Repositories

thaolmk54/hcrn-videoqa
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
video-question-answering-on-sutd-trafficqaHCRN
1/2: 63.79
1/4: 36.49
visual-question-answering-on-msrvtt-qa-1HCRN
Accuracy: 0.356
visual-question-answering-on-msvd-qa-1HCRN
Accuracy: 0.361

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp