HyperAIHyperAI

Command Palette

Search for a command to run...

RLAIF-V-Dataset Large-scale Multimodal Preference Dataset

Date

a year ago

Size

11.77 GB

Organization

OpenBMB

Paper URL

arxiv.org

The RLAIF-V dataset is an AI-generated multimodal preference dataset that covers a variety of tasks and domains. The dataset contains more than 44,757 high-quality comparison pairs for training and evaluating multimodal large language models (MLLMs). The RLAIF-V dataset uses a novel approach to use open source large models to de-confound model responses and provide high-quality feedback data to reduce the hallucination phenomenon of different MLLMs.

In addition, the RLAIF-V dataset was used to train the MiniCPM-Llama3-V 2.5 model, which represents the first end-side GPT-4V-level MLLM17. The RLAIF-V project has open-sourced the code, weights (7B, 12B), and data for use and further research by the research community.

The main features of the RLAIF-V dataset include:

  1. High-quality feedback data: Effective reduction of hallucinations by different MLLMs used in the dataset.
  2. Open Source: The dataset is completely open source, allowing researchers to access and use it freely.
  3. Multi-task and multi-domain: The dataset covers a wide range of tasks and domains, providing diverse preference data.

The license of the RLAIF-V dataset is CC BY NC 4.0, which allows non-commercial use only, and models trained using this dataset should not be used outside of research purposes.

RLAIF-V-Dataset.torrent
Seeding 1Downloading 0Completed 161Total Downloads 206
  • RLAIF-V-Dataset/
    • README.md
      1.86 KB
    • README.txt
      3.72 KB
      • data/
        • RLAIF-V-Dataset.zip
          11.77 GB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
RLAIF-V-Dataset Large-scale Multimodal Preference Dataset | Datasets | HyperAI