HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

STAR-Net: Action Recognition using Spatio-Temporal Activation Reprojection

McNally William ; Wong Alexander ; McPhee John

STAR-Net: Action Recognition using Spatio-Temporal Activation
  Reprojection

Abstract

While depth cameras and inertial sensors have been frequently leveraged forhuman action recognition, these sensing modalities are impractical in manyscenarios where cost or environmental constraints prohibit their use. As such,there has been recent interest on human action recognition using low-cost,readily-available RGB cameras via deep convolutional neural networks. However,many of the deep convolutional neural networks proposed for action recognitionthus far have relied heavily on learning global appearance cues directly fromimaging data, resulting in highly complex network architectures that arecomputationally expensive and difficult to train. Motivated to reduce networkcomplexity and achieve higher performance, we introduce the concept ofspatio-temporal activation reprojection (STAR). More specifically, we reprojectthe spatio-temporal activations generated by human pose estimation layers inspace and time using a stack of 3D convolutions. Experimental results onUTD-MHAD and J-HMDB demonstrate that an end-to-end architecture based on theproposed STAR framework (which we nickname STAR-Net) is proficient insingle-environment and small-scale applications. On UTD-MHAD, STAR-Netoutperforms several methods using richer data modalities such as depth andinertial sensors.

Benchmarks

BenchmarkMethodologyMetrics
multimodal-activity-recognition-on-utd-mhadSTAR-Net
Accuracy (CS): 90
skeleton-based-action-recognition-on-j-hmdbSTAR-Net
Accuracy (RGB+pose): 64.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
STAR-Net: Action Recognition using Spatio-Temporal Activation Reprojection | Papers | HyperAI