HyperAIHyperAI

Command Palette

Search for a command to run...

LongAlign 10K Large Model Long Context Alignment Dataset

Date

2 years ago

Size

392.42 MB

Organization

Tsinghua University

LongAlign-10k is a dataset proposed by Tsinghua University to address the challenges faced by large models in long-context alignment tasks. It contains 10,000 long instruction data with a length between 8k and 64k.

During the construction process, the dataset first draws materials from 9 different fields such as books, encyclopedias, academic papers, and codes, and then uses the Claude 2.1 large model to generate diverse tasks and answers in a long context. This dataset is designed to evaluate the performance of large models in long contexts and their ability to follow 10k-100k length task instructions.

LongAlign.torrent
Seeding 2Downloading 0Completed 268Total Downloads 422
  • LongAlign/
    • README.md
      1.28 KB
    • README.txt
      2.57 KB
      • data/
        • LongAlign-10k.zip
          392.42 MB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
LongAlign 10K Large Model Long Context Alignment Dataset | Datasets | HyperAI