HyperAIHyperAI

Command Palette

Search for a command to run...

AceReason-1.1-SFT Mathematical Code Reasoning Dataset

Date

4 months ago

Organization

NVIDIA

Paper URL

arxiv.org

Join the Discord Community

AceReason-1.1-SFT is a diverse and high-quality supervised fine-tuning (SFT) dataset released by NVIDIA in 2025, focusing on mathematical and code reasoning. The related paper results are:AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy", which aims to train SFT models that focus on mathematical and code reasoning.

This dataset serves as a mathematical and code reasoning model AceReason-Nemotron-1.1-7B SFT training data of , all answers in the dataset are generated by DeepSeek-R1.

The AceReason-1.1-SFT dataset contains 2,668,741 math samples and 1,301,591 code samples, covering data from OpenMathReasoning, NuminaMath-CoT, OpenCodeReasoning, MagicoderEvolInstruct, opc-sft-stage2, leetcode, TACO, and apps. The dataset is cleaned and samples with 9-gram overlap with any test samples in math and coding benchmarks are filtered.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
AceReason-1.1-SFT Mathematical Code Reasoning Dataset | Datasets | HyperAI