Command Palette
Search for a command to run...
Duan Haodong ; Zhao Yue ; Chen Kai ; Lin Dahua ; Dai Bo

Abstract
Human skeleton, as a compact representation of human action, has receivedincreasing attention in recent years. Many skeleton-based action recognitionmethods adopt graph convolutional networks (GCN) to extract features on top ofhuman skeletons. Despite the positive results shown in previous works,GCN-based methods are subject to limitations in robustness, interoperability,and scalability. In this work, we propose PoseC3D, a new approach toskeleton-based action recognition, which relies on a 3D heatmap stack insteadof a graph sequence as the base representation of human skeletons. Compared toGCN-based methods, PoseC3D is more effective in learning spatiotemporalfeatures, more robust against pose estimation noises, and generalizes better incross-dataset settings. Also, PoseC3D can handle multiple-person scenarioswithout additional computation cost, and its features can be easily integratedwith other modalities at early fusion stages, which provides a great designspace to further boost the performance. On four challenging datasets, PoseC3Dconsistently obtains superior performance, when used alone on skeletons and incombination with the RGB modality.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-action-recognition-on-assembly101 | RGBPoseConv3D | Actions Top-1: 33.61 Object Top-1: 42.90 Verbs Top-1: 61.99 |
| action-recognition-in-videos-on-ntu-rgbd | PoseC3D (RGB + Pose) | Accuracy (CS): 97.0 Accuracy (CV): 99.6 |
| action-recognition-in-videos-on-ntu-rgbd-120 | PoseC3D (RGB + Pose) | Accuracy (Cross-Setup): 96.4 Accuracy (Cross-Subject): 95.3 |
| action-recognition-in-videos-on-volleyball | PoseC3D (Pose Only) | Accuracy: 91.3 |
| action-recognition-on-h2o-2-hands-and-objects | RGBPoseConv3D | Actions Top-1: 83.47 Hand Pose: 2D Object Label: No Object Pose: No RGB: Yes |
| group-activity-recognition-on-volleyball | PoseC3D (Pose-Only) | Accuracy: 91.3 |
| skeleton-based-action-recognition-on-kinetics | PoseC3D | Accuracy: 47.7 |
| skeleton-based-action-recognition-on-kinetics | PoseC3D (SlowOnly-346) | Accuracy: 49.1 |
| skeleton-based-action-recognition-on-ntu-rgbd | PoseC3D [3D Heatmap] | Accuracy (CS): 94.1 Accuracy (CV): 97.1 Ensembled Modalities: 2 |
| skeleton-based-action-recognition-on-ntu-rgbd-1 | PoseC3D (w. HRNet 2D Skeleton) | Accuracy (Cross-Setup): 90.3 Accuracy (Cross-Subject): 86.9 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.