HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Librispeech Transducer Model with Internal Language Model Prior Correction

Albert Zeyer André Merboldt Wilfried Michel Ralf Schlüter Hermann Ney

Librispeech Transducer Model with Internal Language Model Prior Correction

Abstract

We present our transducer model on Librispeech. We study variants to include an external language model (LM) with shallow fusion and subtract an estimated internal LM. This is justified by a Bayesian interpretation where the transducer model prior is given by the estimated internal LM. The subtraction of the internal LM gives us over 14% relative improvement over normal shallow fusion. Our transducer has a separate probability distribution for the non-blank labels which allows for easier combination with the external LM, and easier estimation of the internal LM. We additionally take care of including the end-of-sentence (EOS) probability of the external LM in the last blank probability which further improves the performance. All our code and setups are published.

Benchmarks

BenchmarkMethodologyMetrics
speech-recognition-on-librispeech-test-cleanLSTM Transducer
Word Error Rate (WER): 2.23
speech-recognition-on-librispeech-test-otherLSTM Transducer
Word Error Rate (WER): 5.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Librispeech Transducer Model with Internal Language Model Prior Correction | Papers | HyperAI