HyperAIHyperAI

Command Palette

Search for a command to run...

LAION-SG Large-scale high-quality Image Understanding Dataset

Date

a year ago

Size

158.26 MB

Organization

Alibaba Group
浙江大学
Peking University

Publish URL

github.com

Paper URL

arxiv.org

LAION-SG is a large-scale, high-quality image understanding dataset built by Zhejiang University, Jiangnan University, Peking University, Alibaba Group, and Ant Group in 2024.LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations". LAION-SG contains 540,005 scene graph-image pairs with object, attribute and relationship annotations, which are divided into training, validation and test sets. The images in the dataset are from the LAION-Aesthetics V2 (6.5+) dataset, and the annotation process uses GPT-4o for automatic annotation. Compared to the original LAION-Aesthetics dataset, LAION-SG has improved both the average annotation length and accuracy. Each sample in this dataset contains an average of 6.39 objects, and the object information has increased by 20%. If abstract proper nouns are excluded, this advantage increases to 216%. The LAION-SG dataset is suitable for a variety of cross-modal research fields of images and text, including image description generation, visual question answering systems, and image retrieval tasks, all of which rely on a deep understanding and semantic parsing of image content.

LAION-SG.torrent
Seeding 1Downloading 0Completed 191Total Downloads 356
  • LAION-SG/
    • README.md
      1.85 KB
    • README.txt
      3.69 KB
      • data/
        • LAION-SG.zip
          158.26 MB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing

HyperAI Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp