HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Sagalee: an Open Source Automatic Speech Recognition Dataset for Oromo Language

Turi Abu Ying Shi Thomas Fang Zheng Dong Wang

Sagalee: an Open Source Automatic Speech Recognition Dataset for Oromo Language

Abstract

We present a novel Automatic Speech Recognition (ASR) dataset for the Oromo language, a widely spoken language in Ethiopia and neighboring regions. The dataset was collected through a crowd-sourcing initiative, encompassing a diverse range of speakers and phonetic variations. It consists of 100 hours of real-world audio recordings paired with transcriptions, covering read speech in both clean and noisy environments. This dataset addresses the critical need for ASR resources for the Oromo language which is underrepresented. To show its applicability for the ASR task, we conducted experiments using the Conformer model, achieving a Word Error Rate (WER) of 15.32% with hybrid CTC and AED loss and WER of 18.74% with pure CTC loss. Additionally, fine-tuning the Whisper model resulted in a significantly improved WER of 10.82%. These results establish baselines for Oromo ASR, highlighting both the challenges and the potential for improving ASR performance in Oromo. The dataset is publicly available at https://github.com/turinaf/sagalee and we encourage its use for further research and development in Oromo speech processing.

Code Repositories

turinaf/sagalee
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
automatic-speech-recognition-asr-on-sagaleeConformer
Test WER: 15.32
automatic-speech-recognition-asr-on-sagaleeWhisper-largev3-finetuned
Test WER: 10.82

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Sagalee: an Open Source Automatic Speech Recognition Dataset for Oromo Language | Papers | HyperAI