| PoseC3D (RGB + Pose) | 97.0 | 99.6 | Revisiting Skeleton-based Action Recognition | |
| Hierarchical Action Classification (RGB + Pose) | 95.66 | 98.79 | Hierarchical Action Classification with Network Pruning | - |
| EPP-Net (Parsing + Pose) | 94.7 | 97.7 | Explore Human Parsing Modality for Action Recognition | |
| Action Machine (RGB only) | 94.3 | 97.2 | Action Machine: Rethinking Action Recognition in Trimmed Videos | - |