HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Vertex Feature Encoding and Hierarchical Temporal Modeling in a Spatial-Temporal Graph Convolutional Network for Action Recognition

Konstantinos Papadopoulos Enjie Ghorbel Djamila Aouada Björn Ottersten

Vertex Feature Encoding and Hierarchical Temporal Modeling in a Spatial-Temporal Graph Convolutional Network for Action Recognition

Abstract

This paper extends the Spatial-Temporal Graph Convolutional Network (ST-GCN) for skeleton-based action recognition by introducing two novel modules, namely, the Graph Vertex Feature Encoder (GVFE) and the Dilated Hierarchical Temporal Convolutional Network (DH-TCN). On the one hand, the GVFE module learns appropriate vertex features for action recognition by encoding raw skeleton data into a new feature space. On the other hand, the DH-TCN module is capable of capturing both short-term and long-term temporal dependencies using a hierarchical dilated convolutional network. Experiments have been conducted on the challenging NTU RGB-D-60 and NTU RGB-D 120 datasets. The obtained results show that our method competes with state-of-the-art approaches while using a smaller number of layers and parameters; thus reducing the required training time and memory.

Benchmarks

BenchmarkMethodologyMetrics
action-recognition-in-videos-on-ntu-rgbd-120ST-GCN + AS-GCN w/DH-TCN
Accuracy (Cross-Setup): 78.3
Accuracy (Cross-Subject): 79.2
skeleton-based-action-recognition-on-ntu-rgbdGVFE + AS-GCN with DH-TCN
Accuracy (CS): 85.3
Accuracy (CV): 92.8
skeleton-based-action-recognition-on-ntu-rgbd-1GVFE + AS-GCN with DH-TCN
Accuracy (Cross-Setup): 79.8%
Accuracy (Cross-Subject): 78.3%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Vertex Feature Encoding and Hierarchical Temporal Modeling in a Spatial-Temporal Graph Convolutional Network for Action Recognition | Papers | HyperAI