HyperAIHyperAI

Command Palette

Search for a command to run...

T-Wix Russian SFT Dataset

Date

3 months ago

Size

1.43 GB

Paper URL

arxiv.org

T-Wix is a Russian SFT dataset, and the related paper is "From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning", which aims to enhance the model's capabilities from solving algorithmic and mathematical problems to dialogue, logical thinking and reasoning patterns.

The dataset contains 499,598 Russian language samples, including 468,614 general samples covering a variety of areas, including mathematics, science, programming, general knowledge, instruction following, role-playing, etc. The reasoning samples contain 30,984 data points, focusing on advanced mathematics and science problems and providing detailed reasoning traces.

T-Wix.torrent
Seeding 1Downloading 0Completed 31Total Downloads 89
  • T-Wix/
    • README.md
      1.28 KB
    • README.txt
      2.57 KB
      • data/
        • T-Wix.zip
          1.43 GB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
T-Wix Russian SFT Dataset | Datasets | HyperAI