HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

SL-DML: Signal Level Deep Metric Learning for Multimodal One-Shot Action Recognition

Memmesheimer Raphael ; Theisen Nick ; Paulus Dietrich

SL-DML: Signal Level Deep Metric Learning for Multimodal One-Shot Action
  Recognition

Abstract

Recognizing an activity with a single reference sample using metric learningapproaches is a promising research field. The majority of few-shot methodsfocus on object recognition or face-identification. We propose a metriclearning approach to reduce the action recognition problem to a nearestneighbor search in embedding space. We encode signals into images and extractfeatures using a deep residual CNN. Using triplet loss, we learn a featureembedding. The resulting encoder transforms features into an embedding space inwhich closer distances encode similar actions while higher distances encodedifferent actions. Our approach is based on a signal level formulation andremains flexible across a variety of modalities. It further outperforms thebaseline on the large scale NTU RGB+D 120 dataset for the One-Shot actionrecognition protocol by 5.6%. With just 60% of the training data, our approachstill outperforms the baseline approach by 3.7%. With 40% of the training data,our approach performs comparably well to the second follow up. Further, we showthat our approach generalizes well in experiments on the UTD-MHAD dataset forinertial, skeleton and fused data and the Simitate dataset for motion capturingdata. Furthermore, our inter-joint and inter-sensor experiments suggest goodcapabilities on previously unseen setups.

Code Repositories

raphaelmemmesheimer/sl-dml
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
one-shot-3d-action-recognition-on-ntu-rgbdDeep Metric Learning (Triplet Loss, Signals)
Accuracy: 49.6%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
SL-DML: Signal Level Deep Metric Learning for Multimodal One-Shot Action Recognition | Papers | HyperAI