HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Revisiting Skeleton-based Action Recognition

Duan Haodong ; Zhao Yue ; Chen Kai ; Lin Dahua ; Dai Bo

Revisiting Skeleton-based Action Recognition

Abstract

Human skeleton, as a compact representation of human action, has receivedincreasing attention in recent years. Many skeleton-based action recognitionmethods adopt graph convolutional networks (GCN) to extract features on top ofhuman skeletons. Despite the positive results shown in previous works,GCN-based methods are subject to limitations in robustness, interoperability,and scalability. In this work, we propose PoseC3D, a new approach toskeleton-based action recognition, which relies on a 3D heatmap stack insteadof a graph sequence as the base representation of human skeletons. Compared toGCN-based methods, PoseC3D is more effective in learning spatiotemporalfeatures, more robust against pose estimation noises, and generalizes better incross-dataset settings. Also, PoseC3D can handle multiple-person scenarioswithout additional computation cost, and its features can be easily integratedwith other modalities at early fusion stages, which provides a great designspace to further boost the performance. On four challenging datasets, PoseC3Dconsistently obtains superior performance, when used alone on skeletons and incombination with the RGB modality.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
3d-action-recognition-on-assembly101RGBPoseConv3D
Actions Top-1: 33.61
Object Top-1: 42.90
Verbs Top-1: 61.99
action-recognition-in-videos-on-ntu-rgbdPoseC3D (RGB + Pose)
Accuracy (CS): 97.0
Accuracy (CV): 99.6
action-recognition-in-videos-on-ntu-rgbd-120PoseC3D (RGB + Pose)
Accuracy (Cross-Setup): 96.4
Accuracy (Cross-Subject): 95.3
action-recognition-in-videos-on-volleyballPoseC3D (Pose Only)
Accuracy: 91.3
action-recognition-on-h2o-2-hands-and-objectsRGBPoseConv3D
Actions Top-1: 83.47
Hand Pose: 2D
Object Label: No
Object Pose: No
RGB: Yes
group-activity-recognition-on-volleyballPoseC3D (Pose-Only)
Accuracy: 91.3
skeleton-based-action-recognition-on-kineticsPoseC3D
Accuracy: 47.7
skeleton-based-action-recognition-on-kineticsPoseC3D (SlowOnly-346)
Accuracy: 49.1
skeleton-based-action-recognition-on-ntu-rgbdPoseC3D [3D Heatmap]
Accuracy (CS): 94.1
Accuracy (CV): 97.1
Ensembled Modalities: 2
skeleton-based-action-recognition-on-ntu-rgbd-1PoseC3D (w. HRNet 2D Skeleton)
Accuracy (Cross-Setup): 90.3
Accuracy (Cross-Subject): 86.9

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Revisiting Skeleton-based Action Recognition | Papers | HyperAI