Date

a year ago

Size

465.89 MB

Organization

Paper URL

arxiv.org

Tags

Multimodal

VRC-Bench is the first benchmark designed specifically for multimodal step-by-step reasoning tasks. It aims to comprehensively evaluate the performance of models in complex reasoning scenarios. It was released in 2025 by Mohamed bin Zayed University of Artificial Intelligence, University of Central Florida, Linköping University and Australian National University. The related paper results are "LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs". Unlike traditional benchmarks that only focus on the accuracy of the final result, VRC-Bench focuses on evaluating the quality of each reasoning step, providing a more detailed assessment of model capabilities. The dataset covers challenges in eight different fields, including visual reasoning, mathematical and logical reasoning, scientific reasoning, cultural and social understanding, etc. These tasks involve complex visual perception, scientific reasoning, medical image interpretation and other scenarios, and contain more than 4k manually verified reasoning steps, which can comprehensively evaluate the accuracy and logical coherence of the model in multi-step reasoning.

VRC-Bench.torrent

Seeding 1Downloading 0Completed 99Total Downloads 212

VRC-Bench/
- README.md
  1.79 KB
- README.txt
  3.58 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at support@hyper.ai for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

a year ago

Size

465.89 MB

Organization

Paper URL

arxiv.org

Related Datasets

Groundsource Global Flood Events Dataset

3 months ago

CHIMERA General Inference Synthetic Dataset

3 months ago

CL-bench Context Learning Evaluation Benchmark Dataset

4 months ago

RoVid-X Robot Video Generation Dataset

2 months ago

Nemotron-Math-v2 Mathematical Inference Dataset

5 months ago

GroundingME Complex Scene Understanding Evaluation Dataset

5 months ago

MCIF Multimodal Cross-Language Instruction Following Dataset

5 months ago

TxT360-3efforts Multi-Task Inference Dataset

5 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

VRC-Bench Visual Reasoning Benchmark Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

VRC-Bench Visual Reasoning Benchmark Dataset

Related Datasets

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

CL-bench Context Learning Evaluation Benchmark Dataset

RoVid-X Robot Video Generation Dataset

Nemotron-Math-v2 Mathematical Inference Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

TxT360-3efforts Multi-Task Inference Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

VRC-Bench Visual Reasoning Benchmark Dataset

Related Datasets

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

CL-bench Context Learning Evaluation Benchmark Dataset

RoVid-X Robot Video Generation Dataset

Nemotron-Math-v2 Mathematical Inference Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

TxT360-3efforts Multi-Task Inference Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

CL-bench Context Learning Evaluation Benchmark Dataset

RoVid-X Robot Video Generation Dataset

Nemotron-Math-v2 Mathematical Inference Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

TxT360-3efforts Multi-Task Inference Dataset

Related Datasets

Groundsource Global Flood Events Dataset

CHIMERA General Inference Synthetic Dataset

CL-bench Context Learning Evaluation Benchmark Dataset

RoVid-X Robot Video Generation Dataset

Nemotron-Math-v2 Mathematical Inference Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

TxT360-3efforts Multi-Task Inference Dataset