HyperAIHyperAI

Command Palette

Search for a command to run...

WebInstruct-verified multi-domain Reasoning Dataset

Date

3 months ago

Organization

University of Waterloo

Paper URL

arxiv.org

License

Apache 2.0

Join the Discord Community

WebInstruct-verified is a multi-domain reasoning dataset jointly released by the University of Waterloo and Vector Institute in 2025. The related paper results are "General-Reasoner: Advancing LLM Reasoning Across All Domains", which aims to enhance LLMs' reasoning ability in diverse fields while retaining their strengths in mathematics.

This dataset contains approximately 230,000 reasoning questions, covering a variety of answer formats, including multiple-choice questions and a balanced distribution of numerical expression datasets. The dataset primarily covers disciplines such as mathematics, physics, chemistry, finance, and various other humanities and social sciences.

Dataset characteristics:

  • Zero RL training: Direct reinforcement learning from the base LLM, bypassing the intermediate supervision stage.
  • Diverse reasoning data: Over 230K high-quality, verifiable questions sourced from the web, filtered for answer verifiability across disciplines.
  • Model-based Verifier: A compact 1.5B generative verifier model for context-aware, thought-chain answer verification that outperforms traditional rule-based approaches.
Dataset field distribution

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
WebInstruct-verified multi-domain Reasoning Dataset | Datasets | HyperAI