HyperAIHyperAI

Command Palette

Search for a command to run...

FDAbench-Full Heterogeneous Data Analysis Benchmark Dataset

Date

19 days ago

Organization

Nanyang Technological University
National University of Singapore

Paper URL

2509.02473

License

CC BY 4.0

FDAbench-Full is the first heterogeneous data analysis task benchmark set for data agents, released by Nanyang Technological University, National University of Singapore and Huawei Technologies Co., Ltd. in 2025. The related paper results are "FDABench: A Benchmark for Data Agents on Analytical Queries over Heterogeneous Data", which aims to evaluate the model's capabilities in database query generation, SQL understanding, and financial data analysis.

The dataset contains 2,007 high-quality analysis tasks, covering a diverse range of data domains, difficulty levels, and task categories. Each example includes complete metadata fields, including: task_id (task unique identifier), instance_id (instance identifier), db (database name/identifier), level (difficulty level: easy/medium/hard), database_type (database system type), question_type (question category), tools_available (list of available tools), and query (main question/query text).

Dataset structure

The dataset contains three task types:

  • Single-choice questions: There are 579 carefully designed questions, each with only one correct answer. They are mainly used to test the model's understanding of database concepts and SQL queries.
  • Multiple-choice questions (Multiple): A total of 760 complex questions with multiple possible correct answers. They include precise numerical calculation results and conclusions based on reasoning, and are used to evaluate the model's comprehensive performance in data analysis and reasoning capabilities.
  • Report Generation (report): A total of 668 questions require the generation of detailed analysis reports, testing the data agent's ability to conduct comprehensive analysis in a multi-data source environment, and providing a standard report as a comparative evaluation benchmark.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
FDAbench-Full Heterogeneous Data Analysis Benchmark Dataset | Datasets | HyperAI