Date

a year ago

Size

9.3 GB

Features and advantages:

Wide multi-language coverage: It includes 13 languages, covering multiple language families (such as Indo-European, Sino-Tibetan, Arabic, etc.).
Long document feature: The average length of a document is 4,737 words, which is suitable for long text processing needs in real scenarios.
Standardized construction: Generate high-quality queries through GPT-3.5 to ensure strong relevance of queries to document content.

MLDR.torrent

Seeding 1Downloading 0Completed 110Total Downloads 196

MLDR/
- README.md
  1.62 KB
- README.txt
  3.24 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at support@hyper.ai for prompt review and removal.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

a year ago

Size

9.3 GB

Features and advantages:

Wide multi-language coverage: It includes 13 languages, covering multiple language families (such as Indo-European, Sino-Tibetan, Arabic, etc.).
Long document feature: The average length of a document is 4,737 words, which is suitable for long text processing needs in real scenarios.
Standardized construction: Generate high-quality queries through GPT-3.5 to ensure strong relevance of queries to document content.

MLDR.torrent

Seeding 1Downloading 0Completed 110Total Downloads 196

MLDR/
- README.md
  1.62 KB
- README.txt
  3.24 KB

MDPBench Multilingual Document Parsing Benchmark Dataset

2 months ago

LightOnOCR-mix-0126 Text Transcription Dataset

4 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

a year ago

Size

9.3 GB

Features and advantages:

Wide multi-language coverage: It includes 13 languages, covering multiple language families (such as Indo-European, Sino-Tibetan, Arabic, etc.).
Long document feature: The average length of a document is 4,737 words, which is suitable for long text processing needs in real scenarios.
Standardized construction: Generate high-quality queries through GPT-3.5 to ensure strong relevance of queries to document content.

MLDR.torrent

Seeding 1Downloading 0Completed 110Total Downloads 196

MLDR/
- README.md
  1.62 KB
- README.txt
  3.24 KB

MDPBench Multilingual Document Parsing Benchmark Dataset

2 months ago

LightOnOCR-mix-0126 Text Transcription Dataset

4 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

MLDR Multilingual Document Retrieval Dataset | Datasets | HyperAI

Command Palette

MLDR Multilingual Document Retrieval Dataset

Features and advantages:

Build AI with AI

HyperAI Newsletters

Command Palette

MLDR Multilingual Document Retrieval Dataset

Features and advantages:

MDPBench Multilingual Document Parsing Benchmark Dataset

LightOnOCR-mix-0126 Text Transcription Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

MLDR Multilingual Document Retrieval Dataset

Features and advantages:

MDPBench Multilingual Document Parsing Benchmark Dataset

LightOnOCR-mix-0126 Text Transcription Dataset

Build AI with AI

HyperAI Newsletters

MDPBench Multilingual Document Parsing Benchmark Dataset

LightOnOCR-mix-0126 Text Transcription Dataset

MDPBench Multilingual Document Parsing Benchmark Dataset

LightOnOCR-mix-0126 Text Transcription Dataset