HyperAIHyperAI

Command Palette

Search for a command to run...

ShareGPT4V Large-scale high-quality Image and Text Dataset

Date

a year ago

Size

466.32 MB

Organization

University of Science and Technology of China
Shanghai Artificial Intelligence Laboratory

Publish URL

github.com

Paper URL

arxiv.org

License

CC BY-SA 4.0

Featured Image

The ShareGPT4V dataset is a high-quality dataset consisting of a large number of image-text pairs, which is used to train visual-language models (VLMs) to improve the model's capabilities in image understanding and text generation. The dataset contains 1.2 million image-text pairs that effectively align visual and language features, enhance the model's ability to follow instructions, and incorporate more academic tasks such as ScienceQA, TextVQA, SBU, etc. By introducing this dataset, the model has been significantly improved in image-text alignment capabilities, which is a key aspect for multimodal representation learning.

This dataset was released by the University of Science and Technology of China, Shanghai Artificial Intelligence Laboratory in 2023.

ShareGPT4V.torrent
Seeding 1Downloading 0Completed 151Total Downloads 245
  • ShareGPT4V/
    • README.md
      1.51 KB
    • README.txt
      3.03 KB
      • data/
        • ShareGPT4V.zip
          466.32 MB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
ShareGPT4V Large-scale high-quality Image and Text Dataset | Datasets | HyperAI