HyperAIHyperAI

Command Palette

Search for a command to run...

NonverbalTTS non-verbal Audio Generation Dataset

*This dataset supports online use.Click here to jump.

NonverbalTTS is a non-verbal audio generation dataset released by VK Lab and Yandex in 2025. The related paper results are "NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech", which aims to promote expressive text-to-audio (TTS) research and support models to generate natural speech that contains emotions and non-verbal sounds.

The dataset contains 17 hours of high-quality speech data from 2,296 participants (60% males, 40% females), covering 10 non-verbal speech types (breathing, laughing, sighing, sneezing, coughing, throat clearing, groaning, grunting, snoring, and inhaling) and 8 emotion categories (anger, disgust, fear, happiness, neutral, sadness, surprise, and other).

Dataset features:

  • Multi-source data: derived from VoxCeleb and Expresso corpora
  • Rich metadata: emotion tags, non-verbal speech annotations, speaker IDs, audio quality metrics
  • Sampling rate: 16kHz for audio from VoxCeleb, 48kHz for audio from Expresso
NonverbalTTS.torrent
Seeding 1Downloading 0Completed 23Total Downloads 86
  • NonverbalTTS/
    • README.md
      1.77 KB
    • README.txt
      3.55 KB
      • data/
        • NonverbalTTS.zip
          3.06 GB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
NonverbalTTS non-verbal Audio Generation Dataset | Datasets | HyperAI