HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Automatic Speech Recognition in German: A Detailed Error Analysis

{René Peinl Johannes Wirth}

Automatic Speech Recognition in German: A Detailed Error Analysis

Abstract

The amount of freely available systems for automatic speech recognition (ASR) based on neural networks is growing steadily, with equally increasingly reliable predictions. However, the evaluation of trained models is typically exclusively based on statistical metrics such as WER or CER, which do not provide any insight into the nature or impact of the errors produced when predicting transcripts from speech input. This work presents a selection of ASR model architectures that are pretrained on the German language and evaluates them on a benchmark of diverse test datasets. It identifies cross-architectural prediction errors, classifies those into categories and traces the sources of errors per category back into training data as well as other sources. Finally, it discusses solutions in order to create qualitatively better training datasets and more robust ASR systems.

Benchmarks

BenchmarkMethodologyMetrics
automatic-speech-recognition-on-huiConformer Transducer
WER (%): 1.89%
automatic-speech-recognition-on-m-ailabsConformer Transducer
WER (%): 4.28%
automatic-speech-recognition-on-the-spokenConformer Transducer
WER (%): 8.04%
automatic-speech-recognition-on-voxforgeConformer Transducer
WER (%): 3.36%
automatic-speech-recognition-on-voxpopuliConformer Transducer (German)
WER (%): 8.98%
speech-recognition-on-common-voice-germanConformer Transducer (no LM)
Test WER: 6.28%
speech-recognition-on-tudaConformer-Transducer (no LM)
Test WER: 5.82%

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Automatic Speech Recognition in German: A Detailed Error Analysis | Papers | HyperAI