Date

a year ago

Organization

Paper URL

Tags

This dataset is a multimodal textbook dataset released by Alibaba DAMO Academy in 2025. The relevant paper results are:2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining", which aims to enhance multimodal pre-training and expand the model's ability to handle interleaved visual and textual inputs. The dataset contains 6.5 million images and 800 million text data from teaching videos. All images and texts are extracted from online teaching videos (22,000 class hours), covering six basic subjects such as mathematics, physics, and chemistry, providing a more coherent background and richer knowledge for image-text alignment. Example of building a dataset from a tutorial video

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at support@hyper.ai for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Discuss on Discord

Date

a year ago

Organization

Paper URL

arxiv.org

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Multimodal-Textbook-6.5M Multimodal Textbook Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

Multimodal-Textbook-6.5M Multimodal Textbook Dataset

Related Datasets

DRACO Cross-Disciplinary Deep Research Benchmark Dataset

Nemotron Personas France (French Synthetic Personas Dataset)

Sutra 10B Pretraining Teaching and Training Dataset

Student Mental Health and Burnout Dataset

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

Pan-Cancer scRNA-Seq Cancer Single-Cell Transcriptional Atlas Dataset

THINGS-EEG EEG Dataset

THINGS-MEG Magnetoencephalography Dataset

THINGS-fMRI Functional Magnetic Resonance Imaging Dataset

Nemotron-Personas-Brazil Brazilian Synthetic Character Dataset

CL-bench Context Learning Evaluation Benchmark Dataset

RoVid-X Robot Video Generation Dataset

LightOnOCR-mix-0126 Text Transcription Dataset

TransPhy3D Transparent Reflection Synthesis Video Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

TxT360-3efforts Multi-Task Inference Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

Multimodal-Textbook-6.5M Multimodal Textbook Dataset

Related Datasets

DRACO Cross-Disciplinary Deep Research Benchmark Dataset

Nemotron Personas France (French Synthetic Personas Dataset)

Sutra 10B Pretraining Teaching and Training Dataset

Student Mental Health and Burnout Dataset

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

Pan-Cancer scRNA-Seq Cancer Single-Cell Transcriptional Atlas Dataset

THINGS-EEG EEG Dataset

THINGS-MEG Magnetoencephalography Dataset

THINGS-fMRI Functional Magnetic Resonance Imaging Dataset

Nemotron-Personas-Brazil Brazilian Synthetic Character Dataset

CL-bench Context Learning Evaluation Benchmark Dataset

RoVid-X Robot Video Generation Dataset

LightOnOCR-mix-0126 Text Transcription Dataset

TransPhy3D Transparent Reflection Synthesis Video Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

TxT360-3efforts Multi-Task Inference Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

DRACO Cross-Disciplinary Deep Research Benchmark Dataset

Nemotron Personas France (French Synthetic Personas Dataset)

Sutra 10B Pretraining Teaching and Training Dataset

Student Mental Health and Burnout Dataset

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

Pan-Cancer scRNA-Seq Cancer Single-Cell Transcriptional Atlas Dataset

THINGS-EEG EEG Dataset

THINGS-MEG Magnetoencephalography Dataset

THINGS-fMRI Functional Magnetic Resonance Imaging Dataset

Nemotron-Personas-Brazil Brazilian Synthetic Character Dataset

CL-bench Context Learning Evaluation Benchmark Dataset

RoVid-X Robot Video Generation Dataset

LightOnOCR-mix-0126 Text Transcription Dataset

TransPhy3D Transparent Reflection Synthesis Video Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

TxT360-3efforts Multi-Task Inference Dataset

Related Datasets

DRACO Cross-Disciplinary Deep Research Benchmark Dataset

Nemotron Personas France (French Synthetic Personas Dataset)

Sutra 10B Pretraining Teaching and Training Dataset

Student Mental Health and Burnout Dataset

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

Pan-Cancer scRNA-Seq Cancer Single-Cell Transcriptional Atlas Dataset

THINGS-EEG EEG Dataset

THINGS-MEG Magnetoencephalography Dataset

THINGS-fMRI Functional Magnetic Resonance Imaging Dataset