HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Graph Contrastive Learning for Skeleton-based Action Recognition

Xiaohu Huang Hao Zhou Jian Wang Haocheng Feng Junyu Han Errui Ding Jingdong Wang Xinggang Wang Wenyu Liu Bin Feng

Graph Contrastive Learning for Skeleton-based Action Recognition

Abstract

In the field of skeleton-based action recognition, current top-performing graph convolutional networks (GCNs) exploit intra-sequence context to construct adaptive graphs for feature aggregation. However, we argue that such context is still \textit{local} since the rich cross-sequence relations have not been explicitly investigated. In this paper, we propose a graph contrastive learning framework for skeleton-based action recognition (\textit{SkeletonGCL}) to explore the \textit{global} context across all sequences. In specific, SkeletonGCL associates graph learning across sequences by enforcing graphs to be class-discriminative, \emph{i.e.,} intra-class compact and inter-class dispersed, which improves the GCN capacity to distinguish various action patterns. Besides, two memory banks are designed to enrich cross-sequence context from two complementary levels, \emph{i.e.,} instance and semantic levels, enabling graph contrastive learning in multiple context scales. Consequently, SkeletonGCL establishes a new training paradigm, and it can be seamlessly incorporated into current GCNs. Without loss of generality, we combine SkeletonGCL with three GCNs (2S-ACGN, CTR-GCN, and InfoGCN), and achieve consistent improvements on NTU60, NTU120, and NW-UCLA benchmarks. The source code will be available at \url{https://github.com/OliverHxh/SkeletonGCL}.

Code Repositories

oliverhxh/skeletongcl
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
skeleton-based-action-recognition-on-ntu-rgbdSkeletonGCL (based on CTR-GCN)
Accuracy (CS): 93.1
Accuracy (CV): 97.0
Ensembled Modalities: 4
skeleton-based-action-recognition-on-ntu-rgbd-1SkeletonGCL (based on CTR-GCN)
Accuracy (Cross-Setup): 91.0
Accuracy (Cross-Subject): 89.5
Ensembled Modalities: 4

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Graph Contrastive Learning for Skeleton-based Action Recognition | Papers | HyperAI