HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

SlowFast Network for Continuous Sign Language Recognition

Junseok Ahn Youngjoon Jang Joon Son Chung

SlowFast Network for Continuous Sign Language Recognition

Abstract

The objective of this work is the effective extraction of spatial and dynamic features for Continuous Sign Language Recognition (CSLR). To accomplish this, we utilise a two-pathway SlowFast network, where each pathway operates at distinct temporal resolutions to separately capture spatial (hand shapes, facial expressions) and dynamic (movements) information. In addition, we introduce two distinct feature fusion methods, carefully designed for the characteristics of CSLR: (1) Bi-directional Feature Fusion (BFF), which facilitates the transfer of dynamic semantics into spatial semantics and vice versa; and (2) Pathway Feature Enhancement (PFE), which enriches dynamic and spatial representations through auxiliary subnetworks, while avoiding the need for extra inference time. As a result, our model further strengthens spatial and dynamic representations in parallel. We demonstrate that the proposed framework outperforms the current state-of-the-art performance on popular CSLR datasets, including PHOENIX14, PHOENIX14-T, and CSL-Daily.

Code Repositories

kaistmm/SlowFastSign
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
sign-language-recognition-on-csl-dailySlowFastSign
Word Error Rate (WER): 24.9
sign-language-recognition-on-rwth-phoenixSlowFastSign
Word Error Rate (WER): 18.3
sign-language-recognition-on-rwth-phoenix-1SlowFastSign
Word Error Rate (WER): 18.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
SlowFast Network for Continuous Sign Language Recognition | Papers | HyperAI