HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Better Sign Language Translation with STMC-Transformer

Kayo Yin Jesse Read

Better Sign Language Translation with STMC-Transformer

Abstract

Sign Language Translation (SLT) first uses a Sign Language Recognition (SLR) system to extract sign language glosses from videos. Then, a translation system generates spoken language translations from the sign language glosses. This paper focuses on the translation system and introduces the STMC-Transformer which improves on the current state-of-the-art by over 5 and 7 BLEU respectively on gloss-to-text and video-to-text translation of the PHOENIX-Weather 2014T dataset. On the ASLG-PC12 corpus, we report an increase of over 16 BLEU. We also demonstrate the problem in current methods that rely on gloss supervision. The video-to-text translation of our STMC-Transformer outperforms translation of GT glosses. This contradicts previous claims that GT gloss translation acts as an upper bound for SLT performance and reveals that glosses are an inefficient representation of sign language. For future SLT research, we therefore suggest an end-to-end training of the recognition and translation models, or using a different sign language annotation scheme.

Code Repositories

kayoyin/transformer-slt
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
sign-language-translation-on-aslg-pc12-1Transformer Ens.
BLEU-4: 82.87
sign-language-translation-on-rwth-phoenixSTMC+Transformer (Ens)
BLEU-4: 25.40

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Better Sign Language Translation with STMC-Transformer | Papers | HyperAI