Date

2 years ago

Size

2.41 MB

Organization

Publish URL

omni-math.github.io

Paper URL

arxiv.org

* This dataset supports online use.Click here to jump.

Omni-MATH is an Olympic-level mathematical reasoning benchmark dataset created by Peking University and Alibaba, which aims to evaluate the performance of large language models (LLMs) on Olympic-level mathematical problems.Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models". This dataset contains 4,428 rigorously manually annotated competition-level math problems, covering 33 subfields and more than 10 different difficulty levels, from the Olympiad preparatory level to top Olympiad mathematics competitions such as IMO (International Mathematical Olympiad), IMC (International Mathematical Contest) and Putnam Mathematics Competition. The creation process of Omni-MATH includes collecting data from global mathematics competitions and verifying it through manual annotation to ensure the high quality and diversity of the data. During the construction of the dataset, the research team used GPT-4o to classify the questions and divide them into different mathematical fields to evaluate the performance of the model in different mathematical fields.

Omni-MATH.torrent

Seeding 1Downloading 0Completed 152Total Downloads 251

Omni-MATH/
- README.md
  1.73 KB
- README.txt
  3.46 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at support@hyper.ai for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

2 years ago

Size

2.41 MB

Organization

Publish URL

omni-math.github.io

Paper URL

arxiv.org

* This dataset supports online use.Click here to jump.

Omni-MATH.torrent

Seeding 1Downloading 0Completed 152Total Downloads 251

Omni-MATH/
- README.md
  1.73 KB
- README.txt
  3.46 KB

Related Datasets

DRACO Cross-Disciplinary Deep Research Benchmark Dataset

2 months ago

Sutra 10B Pretraining Teaching and Training Dataset

2 months ago

Groundsource Global Flood Events Dataset

3 months ago

CHIMERA General Inference Synthetic Dataset

3 months ago

CL-bench Context Learning Evaluation Benchmark Dataset

4 months ago

LightOnOCR-mix-0126 Text Transcription Dataset

4 months ago

Nemotron-Math-v2 Mathematical Inference Dataset

5 months ago

MCIF Multimodal Cross-Language Instruction Following Dataset

5 months ago

LongBench-Pro Long Context Comprehensive Evaluation Dataset

5 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Omni-MATH Mathematical Reasoning Benchmark Dataset

* This dataset supports online use.Click here to jump.

Build AI with AI

HyperAI Newsletters

Command Palette

Omni-MATH Mathematical Reasoning Benchmark Dataset

* This dataset supports online use.Click here to jump.

Related Datasets

DRACO Cross-Disciplinary Deep Research Benchmark Dataset

Sutra 10B Pretraining Teaching and Training Dataset

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

CL-bench Context Learning Evaluation Benchmark Dataset

LightOnOCR-mix-0126 Text Transcription Dataset

Nemotron-Math-v2 Mathematical Inference Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

LongBench-Pro Long Context Comprehensive Evaluation Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

Omni-MATH Mathematical Reasoning Benchmark Dataset

* This dataset supports online use.Click here to jump.

Related Datasets

DRACO Cross-Disciplinary Deep Research Benchmark Dataset

Sutra 10B Pretraining Teaching and Training Dataset

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

CL-bench Context Learning Evaluation Benchmark Dataset

LightOnOCR-mix-0126 Text Transcription Dataset

Nemotron-Math-v2 Mathematical Inference Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

LongBench-Pro Long Context Comprehensive Evaluation Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

DRACO Cross-Disciplinary Deep Research Benchmark Dataset

Sutra 10B Pretraining Teaching and Training Dataset

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

CL-bench Context Learning Evaluation Benchmark Dataset

LightOnOCR-mix-0126 Text Transcription Dataset

Nemotron-Math-v2 Mathematical Inference Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

LongBench-Pro Long Context Comprehensive Evaluation Dataset

Related Datasets

DRACO Cross-Disciplinary Deep Research Benchmark Dataset

Sutra 10B Pretraining Teaching and Training Dataset

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

CL-bench Context Learning Evaluation Benchmark Dataset

LightOnOCR-mix-0126 Text Transcription Dataset

Nemotron-Math-v2 Mathematical Inference Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

LongBench-Pro Long Context Comprehensive Evaluation Dataset