HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Motion Fused Frames: Data Level Fusion Strategy for Hand Gesture Recognition

Okan Köpüklü; Neslihan Köse; Gerhard Rigoll

Motion Fused Frames: Data Level Fusion Strategy for Hand Gesture Recognition

Abstract

Acquiring spatio-temporal states of an action is the most crucial step for action classification. In this paper, we propose a data level fusion strategy, Motion Fused Frames (MFFs), designed to fuse motion information into static images as better representatives of spatio-temporal states of an action. MFFs can be used as input to any deep learning architecture with very little modification on the network. We evaluate MFFs on hand gesture recognition tasks using three video datasets - Jester, ChaLearn LAP IsoGD and NVIDIA Dynamic Hand Gesture Datasets - which require capturing long-term temporal relations of hand movements. Our approach obtains very competitive performance on Jester and ChaLearn benchmarks with the classification accuracies of 96.28% and 57.4%, respectively, while achieving state-of-the-art performance with 84.7% accuracy on NVIDIA benchmark.

Code Repositories

okankop/MFF-pytorch
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
hand-gesture-recognition-on-chalean-test8-MFFs-3f1c
Accuracy: 56.7
hand-gesture-recognition-on-chalearn-val8-MFFs-3f1c (5 crop)
Accuracy: 57.4
hand-gesture-recognition-on-jester-testDRX3D
Top 1 Accuracy: 96.6
hand-gesture-recognition-on-jester-val8-MFFs-3f1c (5 crop)
Top 1 Accuracy: 96.33
Top 5 Accuracy: 99.86
hand-gesture-recognition-on-nvgesture-18-MFFs-3f1c
Accuracy: 84.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Motion Fused Frames: Data Level Fusion Strategy for Hand Gesture Recognition | Papers | HyperAI