HyperAIHyperAI

Command Palette

Search for a command to run...

Pinocchio Pinocchio Factual Knowledge Evaluation Dataset

Date

2 years ago

Size

3.09 MB

Organization

Tsinghua University

Publish URL

github.com

Featured Image

The Pinocchio dataset was jointly created by researchers from Tsinghua University, University of Illinois at Chicago, and University of Cambridge. Its purpose is to comprehensively evaluate the performance of large language models (LLMs) in factual knowledge storage and reasoning capabilities.

This dataset covers 20,000 diverse factual questions covering different sources, timelines, domains, regions, and languages.The dataset contains 7 different tasks to test LLMs’ ability to reason over multiple facts, handle structured and unstructured knowledge, identify subtle factual differences, and resist adversarial examples. Pinocchio provides researchers with a powerful tool to understand the capabilities of models at multiple levels while pushing the boundaries of LLMs’ ability to advance factual knowledge.

Pinocchio.torrent
Seeding 1Downloading 0Completed 112Total Downloads 148
  • Pinocchio/
    • README.md
      1.46 KB
    • README.txt
      2.92 KB
      • data/
        • Pinocchio-main.zip
          3.09 MB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Pinocchio Pinocchio Factual Knowledge Evaluation Dataset | Datasets | HyperAI