HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Gimme Signals: Discriminative signal encoding for multimodal activity recognition

Raphael Memmesheimer; Nick Theisen; Dietrich Paulus

Gimme Signals: Discriminative signal encoding for multimodal activity recognition

Abstract

We present a simple, yet effective and flexible method for action recognition supporting multiple sensor modalities. Multivariate signal sequences are encoded in an image and are then classified using a recently proposed EfficientNet CNN architecture. Our focus was to find an approach that generalizes well across different sensor modalities without specific adaptions while still achieving good results. We apply our method to 4 action recognition datasets containing skeleton sequences, inertial and motion capturing measurements as well as \wifi fingerprints that range up to 120 action classes. Our method defines the current best CNN-based approach on the NTU RGB+D 120 dataset, lifts the state of the art on the ARIL Wi-Fi dataset by +6.78%, improves the UTD-MHAD inertial baseline by +14.4%, the UTD-MHAD skeleton baseline by 1.13% and achieves 96.11% on the Simitate motion capturing data (80/20 split). We further demonstrate experiments on both, modality fusion on a signal level and signal reduction to prevent the representation from overloading.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
action-recognition-in-videos-on-ntu-rgbd-120Gimme Signals (AIS)
Accuracy (Cross-Setup): 70.8
Accuracy (Cross-Subject): 71.59
multimodal-activity-recognition-on-utd-mhadGimme Signals (Skeleton, AIS)
Accuracy (CS): 93.33
skeleton-based-action-recognition-on-ntu-rgbd-1Gimme Signals (Skeleton, AIS)
Accuracy (Cross-Setup): 71.6%
Accuracy (Cross-Subject): 70.8%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Gimme Signals: Discriminative signal encoding for multimodal activity recognition | Papers | HyperAI