Date

10 months ago

Size

3.06 GB

Paper URL

arxiv.org

License

Apache 2.0

Tags

Audio Classification

Text-to-Audio

NonverbalTTS is a non-verbal audio generation dataset released by VK Lab and Yandex in 2025. The related paper results are "NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech", which aims to promote expressive text-to-audio (TTS) research and support models to generate natural speech that contains emotions and non-verbal sounds. The dataset contains 17 hours of high-quality speech data from 2,296 participants (60% males, 40% females), covering 10 non-verbal speech types (breathing, laughing, sighing, sneezing, coughing, throat clearing, groaning, grunting, snoring, and inhaling) and 8 emotion categories (anger, disgust, fear, happiness, neutral, sadness, surprise, and other).

Dataset features:

Multi-source data: derived from VoxCeleb and Expresso corpora
Rich metadata: emotion tags, non-verbal speech annotations, speaker IDs, audio quality metrics
Sampling rate: 16kHz for audio from VoxCeleb, 48kHz for audio from Expresso

NonverbalTTS.torrent

Seeding 1Downloading 0Completed 44Total Downloads 152

NonverbalTTS/
- README.md
  1.77 KB
- README.txt
  3.55 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at support@hyper.ai for prompt review and removal.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

10 months ago

Size

3.06 GB

Paper URL

arxiv.org

License

Apache 2.0

Dataset features:

Multi-source data: derived from VoxCeleb and Expresso corpora
Rich metadata: emotion tags, non-verbal speech annotations, speaker IDs, audio quality metrics
Sampling rate: 16kHz for audio from VoxCeleb, 48kHz for audio from Expresso

NonverbalTTS.torrent

Seeding 1Downloading 0Completed 44Total Downloads 152

NonverbalTTS/
- README.md
  1.77 KB
- README.txt
  3.55 KB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

10 months ago

Size

3.06 GB

Paper URL

arxiv.org

License

Apache 2.0

Dataset features:

Multi-source data: derived from VoxCeleb and Expresso corpora
Rich metadata: emotion tags, non-verbal speech annotations, speaker IDs, audio quality metrics
Sampling rate: 16kHz for audio from VoxCeleb, 48kHz for audio from Expresso

NonverbalTTS.torrent

Seeding 1Downloading 0Completed 44Total Downloads 152

NonverbalTTS/
- README.md
  1.77 KB
- README.txt
  3.55 KB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

NonverbalTTS non-verbal Audio Generation Dataset | Datasets | HyperAI

Command Palette

NonverbalTTS non-verbal Audio Generation Dataset

Dataset features:

Build AI with AI

HyperAI Newsletters

Command Palette

NonverbalTTS non-verbal Audio Generation Dataset

Dataset features:

Build AI with AI

HyperAI Newsletters

Command Palette

NonverbalTTS non-verbal Audio Generation Dataset

Dataset features:

Build AI with AI

HyperAI Newsletters