HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Libri-Light: A Benchmark for ASR with Limited or No Supervision

Libri-Light: A Benchmark for ASR with Limited or No Supervision

Abstract

We introduce a new collection of spoken English audio suitable for training speech recognition systems under limited or no supervision. It is derived from open-source audio books from the LibriVox project. It contains over 60K hours of audio, which is, to our knowledge, the largest freely-available corpus of speech. The audio has been segmented using voice activity detection and is tagged with SNR, speaker ID and genre descriptions. Additionally, we provide baseline systems and evaluation metrics working under three settings: (1) the zero resource/unsupervised setting (ABX), (2) the semi-supervised setting (PER, CER) and (3) the distant supervision setting (WER). Settings (2) and (3) use limited textual resources (10 minutes to 10 hours) aligned with the speech. Setting (3) uses large amounts of unaligned text. They are evaluated on the standard LibriSpeech dev and test sets for comparison with the supervised state-of-the-art.

Code Repositories

facebookresearch/libri-light
Official
pytorch
Mentioned in GitHub
k2-fsa/libriheavy
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
speech-recognition-on-libri-light-test-cleanCPC unlab-60k
ABX-across: 7.56
ABX-within: 5.83
speech-recognition-on-libri-light-test-cleanCPC unlab-60k+train-10h CPC pretrain + CTC fine-tuning + 4gram-LM
Word Error Rate (WER): 43.9
speech-recognition-on-libri-light-test-cleanTDS 60k pseudo-label + CTC fine-tuning + 4gram-LM
Word Error Rate (WER): 29.3
speech-recognition-on-libri-light-test-otherCPC unlab-60k
ABX-across: 13.42
ABX-within: 8.14
speech-recognition-on-libri-light-test-otherCPC unlab-60k+train-10h CPC pretrain + CTC fine-tuning + 4gram-LM
Word Error Rate (WER): 69.5
speech-recognition-on-libri-light-test-otherTDS 60k pseudo-label + CTC fine-tuning + 4gram-LM
Word Error Rate (WER): 56.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Libri-Light: A Benchmark for ASR with Limited or No Supervision | Papers | HyperAI