Date

5 months ago

Organization

Paper URL

2512.06201

License

CC BY 4.0

Tags

Natural Language Processing

Reasoning

TxT360-3efforts is a large-scale language model training dataset for supervised fine-tuning (SFT), released by Mohamed bin Zayed University of Artificial Intelligence in 2025. The related paper is... K2-V2: A 360-Open, Reasoning-Enhanced LLMThe aim is to control the three inference strengths of the model through chat templates. This dataset comprises approximately 10 million samples and 10 billion training tokens, covering nine task categories: mathematics, coding, general dialogue, STEM reasoning, instruction following, tool invocation, agent trajectory, self-identity modeling, and secure alignment. It includes a large number of multi-turn dialogues and samples with verifiable constraints. The data originates from open-source licensed public datasets or high-quality synthetic data, and has undergone rigorous quality filtering, deduplication, and benchmark decontamination. Answers are primarily generated by GPT-OSS-120B at different inference intensities. The dataset explicitly distinguishes between low, medium, and high inference intensities using a unified chat template, enabling the model to learn during training to adjust generation length and inference depth according to different inference requirements.

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at support@hyper.ai for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Discuss on Discord

Date

5 months ago

Organization

Paper URL

2512.06201

License

CC BY 4.0

Related Datasets

zh-meme-sft-8k Chinese Internet Meme Culture Dataset

3 months ago

CHIMERA General Inference Synthetic Dataset

4 months ago

Open-RL Inference Problem Dataset

4 months ago

Nemotron-Math-v2 Mathematical Inference Dataset

5 months ago

GroundingME Complex Scene Understanding Evaluation Dataset

5 months ago

MCIF Multimodal Cross-Language Instruction Following Dataset

5 months ago

X-ray Contraband Detection Dataset

5 months ago

LongBench-Pro Long Context Comprehensive Evaluation Dataset

6 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

TxT360-3efforts Multi-Task Inference Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

TxT360-3efforts Multi-Task Inference Dataset

Related Datasets

zh-meme-sft-8k Chinese Internet Meme Culture Dataset

CHIMERA General Inference Synthetic Dataset

Open-RL Inference Problem Dataset

Nemotron-Math-v2 Mathematical Inference Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

X-ray Contraband Detection Dataset

LongBench-Pro Long Context Comprehensive Evaluation Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

TxT360-3efforts Multi-Task Inference Dataset

Related Datasets

zh-meme-sft-8k Chinese Internet Meme Culture Dataset

CHIMERA General Inference Synthetic Dataset

Open-RL Inference Problem Dataset

Nemotron-Math-v2 Mathematical Inference Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

X-ray Contraband Detection Dataset

LongBench-Pro Long Context Comprehensive Evaluation Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

zh-meme-sft-8k Chinese Internet Meme Culture Dataset

CHIMERA General Inference Synthetic Dataset

Open-RL Inference Problem Dataset

Nemotron-Math-v2 Mathematical Inference Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

X-ray Contraband Detection Dataset

LongBench-Pro Long Context Comprehensive Evaluation Dataset

Related Datasets

zh-meme-sft-8k Chinese Internet Meme Culture Dataset

CHIMERA General Inference Synthetic Dataset

Open-RL Inference Problem Dataset

Nemotron-Math-v2 Mathematical Inference Dataset

GroundingME Complex Scene Understanding Evaluation Dataset

MCIF Multimodal Cross-Language Instruction Following Dataset

X-ray Contraband Detection Dataset

LongBench-Pro Long Context Comprehensive Evaluation Dataset