HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Zero-shot Skeleton-based Action Recognition via Mutual Information Estimation and Maximization

Yujie Zhou; Wenwen Qiang; Anyi Rao; Ning Lin; Bing Su; Jiaqi Wang

Zero-shot Skeleton-based Action Recognition via Mutual Information Estimation and Maximization

Abstract

Zero-shot skeleton-based action recognition aims to recognize actions of unseen categories after training on data of seen categories. The key is to build the connection between visual and semantic space from seen to unseen classes. Previous studies have primarily focused on encoding sequences into a singular feature vector, with subsequent mapping the features to an identical anchor point within the embedded space. Their performance is hindered by 1) the ignorance of the global visual/semantic distribution alignment, which results in a limitation to capture the true interdependence between the two spaces. 2) the negligence of temporal information since the frame-wise features with rich action clues are directly pooled into a single feature vector. We propose a new zero-shot skeleton-based action recognition method via mutual information (MI) estimation and maximization. Specifically, 1) we maximize the MI between visual and semantic space for distribution alignment; 2) we leverage the temporal information for estimating the MI by encouraging MI to increase as more frames are observed. Extensive experiments on three large-scale skeleton action datasets confirm the effectiveness of our method. Code: https://github.com/YujieOuO/SMIE.

Code Repositories

yujieouo/smie
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
zero-shot-skeletal-action-recognition-on-ntuSMIE
Accuracy (12 unseen classes): 40.18
Accuracy (5 unseen classes): 77.98
Random Split Accuracy: 65.08
zero-shot-skeletal-action-recognition-on-ntu-1SMIE
Accuracy (10 unseen classes): 65.74
Accuracy (24 unseen classes): 45.30
Random Split Accuracy: 46.40
zero-shot-skeletal-action-recognition-on-pkuSMIE
Random Split Accuracy: 60.83

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Zero-shot Skeleton-based Action Recognition via Mutual Information Estimation and Maximization | Papers | HyperAI