HyperAIHyperAI

Command Palette

Search for a command to run...

AISHELL-1 Open Source Chinese Speech Database

Date

2 years ago

Size

14.52 GB

Organization

AISHELL

Paper URL

arxiv.org

The Hillshell Chinese Mandarin Open Source Speech Database AISHELL-ASR0009-OS1 has a recording time of 178 hours and is part of the Hillshell Chinese Mandarin Speech Database AISHELL-ASR0009.

The AISHELL-ASR0009 recording text involves 11 fields such as smart home, unmanned driving, and industrial production. The recording process was conducted in a quiet indoor environment, using 3 different devices at the same time: a high-fidelity microphone (44.1kHz, 16-bit); an Android phone (16kHz, 16-bit); and an iOS phone (16kHz, 16-bit). The audio recorded by the high-fidelity microphone was downsampled to 16kHz and used to produce AISHELL-ASR0009-OS1. 400 speakers from different accent areas in China participated in the recording. After being transcribed and annotated by professional voice proofreaders and passing strict quality inspections, the text accuracy of this database is above 95%. It is divided into training set, development set, and test set.

AISHELL-1.torrent
Seeding 2Downloading 0Completed 317Total Downloads 751
  • AISHELL-1/
    • README.md
      1.5 KB
    • README.txt
      3 KB
      • data/
        • AISHELL-1.zip
          14.52 GB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
AISHELL-1 Open Source Chinese Speech Database | Datasets | HyperAI