HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Actor-Centric Relation Network

Chen Sun; Abhinav Shrivastava; Carl Vondrick; Kevin Murphy; Rahul Sukthankar; Cordelia Schmid

Actor-Centric Relation Network

Abstract

Current state-of-the-art approaches for spatio-temporal action localization rely on detections at the frame level and model temporal context with 3D ConvNets. Here, we go one step further and model spatio-temporal relations to capture the interactions between human actors, relevant objects and scene elements essential to differentiate similar human actions. Our approach is weakly supervised and mines the relevant elements automatically with an actor-centric relational network (ACRN). ACRN computes and accumulates pair-wise relation information from actor and global scene features, and generates relation features for action classification. It is implemented as neural networks and can be trained jointly with an existing action detection system. We show that ACRN outperforms alternative approaches which capture relation information, and that the proposed framework improves upon the state-of-the-art performance on JHMDB and AVA. A visualization of the learned relation features confirms that our approach is able to attend to the relevant relations for each action.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
action-recognition-in-videos-on-ava-v21ARCN
mAP (Val): 17.4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Actor-Centric Relation Network | Papers | HyperAI