Date

3 months ago

Organization

License

CC BY 4.0

Data fields:

utt_id: A string representing a unique identifier for the recording.
waveform: Audio type, sampling rate 16,000.
locale: A string representing the recording region.
speaker_id: A string representing a unique identifier for the speaker.
speaker_age: A 32-bit integer representing the speaker's age.
speaker_gender: A string representing the speaker's gender.
environment: A string representing the recording environment.
text: A string type representing the recorded text content.
topk_salient_terms: A list of strings representing keywords.
topk_salient_terms_timestamps: A list of floating-point numbers representing the timestamps of the keywords.

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at support@hyper.ai for prompt review and removal.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset Discuss on Discord

Date

3 months ago

Organization

License

CC BY 4.0

Data fields:

utt_id: A string representing a unique identifier for the recording.
waveform: Audio type, sampling rate 16,000.
locale: A string representing the recording region.
speaker_id: A string representing a unique identifier for the speaker.
speaker_age: A 32-bit integer representing the speaker's age.
speaker_gender: A string representing the speaker's gender.
environment: A string representing the recording environment.
text: A string type representing the recorded text content.
topk_salient_terms: A list of strings representing keywords.
topk_salient_terms_timestamps: A list of floating-point numbers representing the timestamps of the keywords.

MDPBench Multilingual Document Parsing Benchmark Dataset

a month ago

Spam Email Detection Dataset

3 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset Discuss on Discord

Date

3 months ago

Organization

License

CC BY 4.0

Data fields:

utt_id: A string representing a unique identifier for the recording.
waveform: Audio type, sampling rate 16,000.
locale: A string representing the recording region.
speaker_id: A string representing a unique identifier for the speaker.
speaker_age: A 32-bit integer representing the speaker's age.
speaker_gender: A string representing the speaker's gender.
environment: A string representing the recording environment.
text: A string type representing the recorded text content.
topk_salient_terms: A list of strings representing keywords.
topk_salient_terms_timestamps: A list of floating-point numbers representing the timestamps of the keywords.

MDPBench Multilingual Document Parsing Benchmark Dataset

a month ago

Spam Email Detection Dataset

3 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Simple Voice Questions Dataset | Datasets | HyperAI

Command Palette

Simple Voice Questions Dataset

Data fields:

Build AI with AI

HyperAI Newsletters

Command Palette

Simple Voice Questions Dataset

Data fields:

MDPBench Multilingual Document Parsing Benchmark Dataset

Spam Email Detection Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

Simple Voice Questions Dataset

Data fields:

MDPBench Multilingual Document Parsing Benchmark Dataset

Spam Email Detection Dataset

Build AI with AI

HyperAI Newsletters

MDPBench Multilingual Document Parsing Benchmark Dataset

Spam Email Detection Dataset

MDPBench Multilingual Document Parsing Benchmark Dataset

Spam Email Detection Dataset