Date

4 years ago

Organization

Publish URL

srvk.github.io

Paper URL

arxiv.org

License

CC BY-SA 4.0

Tags

Multimodal

Natural Language Processing

Semantic Segmentation

Video Generation

Video Understanding

This is a multilingual video dataset, containing 13,500 videos and 300 hours of speeches, all with English subtitles and Portuguese translations. 185,187 corpora are used for training, 2,022 corpora are used for development (dev), and 2,361 corpora are used for testing. This dataset can be used to study multimodal language understanding.

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at support@hyper.ai for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Discuss on Discord

Date

4 years ago

Organization

Publish URL

srvk.github.io

Paper URL

arxiv.org

License

CC BY-SA 4.0

Related Datasets

RoVid-X Robot Video Generation Dataset

2 months ago

GroundingME Complex Scene Understanding Evaluation Dataset

5 months ago

MCIF Multimodal Cross-Language Instruction Following Dataset

5 months ago

LongBench-Pro Long Context Comprehensive Evaluation Dataset

5 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

How2 Multilingual Video Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

How2 Multilingual Video Dataset

Related Datasets

RoVid-X Robot Video Generation Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

LongBench-Pro Long Context Comprehensive Evaluation Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

How2 Multilingual Video Dataset

Related Datasets

RoVid-X Robot Video Generation Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

LongBench-Pro Long Context Comprehensive Evaluation Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

RoVid-X Robot Video Generation Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

LongBench-Pro Long Context Comprehensive Evaluation Dataset

Related Datasets

RoVid-X Robot Video Generation Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

LongBench-Pro Long Context Comprehensive Evaluation Dataset