Action Recognition In Videos On Ntu Rgbd

评估指标

Accuracy (CS)
Accuracy (CV)

评测结果

各个模型在此基准测试上的表现结果

Paper TitleRepository
DSCNet (RGB + Pose)97.499.4A Dense-Sparse Complementary Network for Human Action Recognition based on RGB and Skeleton Modalities-
PoseC3D (RGB + Pose)97.099.6Revisiting Skeleton-based Action Recognition
π-ViT (RGB + Pose)96.399.0Just Add $\pi$! Pose Induced Video Transformers for Understanding Activities of Daily Living
UMDR (RGB-D)96.298.0A Unified Multimodal De- and Re-coupling Framework for RGB-D Motion Recognition
MMNet (RGB + Pose)96.098.8MMNet: A Model-Based Multimodal Network for Human Action Recognition in RGB-D Videos-
Hierarchical Action Classification (RGB + Pose)95.6698.79Hierarchical Action Classification with Network Pruning-
VPN (RGB + Pose)95.598.0VPN: Learning Video-Pose Embedding for Activities of Daily Living
EPP-Net (Parsing + Pose)94.797.7Explore Human Parsing Modality for Action Recognition
3DA (RGB + Pose)94.397.9Cross-Modal Learning with 3D Deformable Attention for Action Recognition-
Action Machine (RGB only)94.397.2Action Machine: Rethinking Action Recognition in Trimmed Videos-
π-ViT (RGB only)94.097.9Just Add $\pi$! Pose Induced Video Transformers for Understanding Activities of Daily Living
IPP-Net (Parsing + Pose)93.897.1Integrating Human Parsing and Pose Network for Human Action Recognition
ViewCon (RGB + Pose)93.798.9Multi-View Action Recognition Using Contrastive Learning-
DVANet (RGB only)93.498.1DVANet: Disentangling View and Action Features for Multi-View Action Recognition
TSMF (RGB + Pose)92.597.4Multimodal Fusion via Teacher-Student Network for Indoor Action Recognition-
MSAF (RGB+Pose)92.24-MSAF: Multimodal Split Attention Fusion
STAR-Transformer (RGB + Pose)92.096.5STAR-Transformer: A Spatio-temporal Cross Attention Transformer for Human Action Recognition-
MMTM (RGB+Pose)91.99-MMTM: Multimodal Transfer Module for CNN Fusion
FUSION (IR+Pose)91.894.9Infrared and 3D skeleton feature fusion for RGB-D action recognition
PoseMap (RGB+Pose)91.795.2Recognizing Human Actions as the Evolution of Pose Estimation Maps-
0 of 25 row(s) selected.
Action Recognition In Videos On Ntu Rgbd | SOTA | HyperAI超神经