Date

8 months ago

Size

169.51 MB

Organization

Paper URL

2506.21875

License

CC BY 4.0

Tags

Text-to-Audio

WildSpeech-Bench is the first benchmark for evaluating the speech-to-speech capabilities of SpeechLLM, released by Tencent in 2025. The related paper results are "WildSpeech-Bench: Benchmarking End-to-End SpeechLLMs in the Wild", which aims to measure the model's ability to understand and generate complete speech input to speech output (Speech-to-Speech, S2S) in real voice interaction scenarios. The dataset contains 1,100 queries across five main categories: information queries, solution requests, opinion exchanges, text creation, and paralinguistic expressions. Each category corresponds to a common user intent. 1,000 of these queries are from general voice interaction scenarios (including information queries, solution requests, opinion exchanges, and text creation), while another 100 are characterized by paralinguistic features such as pauses, intonation, stuttering, and near-phonetic word recognition. Each query is accompanied by diverse speech output examples, encompassing a wide range of speaker attributes (gender, age, voice variants), acoustic conditions, and noise environment settings, to more realistically simulate the diversity and challenges of natural voice interaction.

WildSpeech-Bench.torrent

Seeding 1Downloading 1Completed 0Total Downloads 84

WildSpeech-Bench/
- README.md
  1.83 KB
- README.txt
  3.66 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at support@hyper.ai for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

8 months ago

Size

169.51 MB

Organization

Paper URL

2506.21875

License

CC BY 4.0

Related Datasets

Sutra 10B Pretraining Teaching and Training Dataset

2 months ago

CL-bench Context Learning Evaluation Benchmark Dataset

4 months ago

DeepPlanning Long-Term Planning Capability Assessment Dataset

4 months ago

GroundingME Complex Scene Understanding Evaluation Dataset

5 months ago

MCIF Multimodal Cross-Language Instruction Following Dataset

5 months ago

TxT360-3efforts Multi-Task Inference Dataset

5 months ago

X-ray Contraband Detection Dataset

5 months ago

LongBench-Pro Long Context Comprehensive Evaluation Dataset

5 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

WildSpeech-Bench Speech Understanding Generation Benchmark Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

WildSpeech-Bench Speech Understanding Generation Benchmark Dataset

Related Datasets

Sutra 10B Pretraining Teaching and Training Dataset

CL-bench Context Learning Evaluation Benchmark Dataset

DeepPlanning Long-Term Planning Capability Assessment Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

TxT360-3efforts Multi-Task Inference Dataset

X-ray Contraband Detection Dataset

LongBench-Pro Long Context Comprehensive Evaluation Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

WildSpeech-Bench Speech Understanding Generation Benchmark Dataset

Related Datasets

Sutra 10B Pretraining Teaching and Training Dataset

CL-bench Context Learning Evaluation Benchmark Dataset

DeepPlanning Long-Term Planning Capability Assessment Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

TxT360-3efforts Multi-Task Inference Dataset

X-ray Contraband Detection Dataset

LongBench-Pro Long Context Comprehensive Evaluation Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

Sutra 10B Pretraining Teaching and Training Dataset

CL-bench Context Learning Evaluation Benchmark Dataset

DeepPlanning Long-Term Planning Capability Assessment Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

TxT360-3efforts Multi-Task Inference Dataset

X-ray Contraband Detection Dataset

LongBench-Pro Long Context Comprehensive Evaluation Dataset

Related Datasets

Sutra 10B Pretraining Teaching and Training Dataset

CL-bench Context Learning Evaluation Benchmark Dataset

DeepPlanning Long-Term Planning Capability Assessment Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

TxT360-3efforts Multi-Task Inference Dataset

X-ray Contraband Detection Dataset

LongBench-Pro Long Context Comprehensive Evaluation Dataset