HyperAIHyperAI

Command Palette

Search for a command to run...

LAION-SG Large-scale high-quality Image Understanding Dataset

Date

a year ago

Size

158.26 MB

Organization

Peking University
Zhejiang University

Publish URL

github.com

Paper URL

arxiv.org

LAION-SG is a large-scale, high-quality image understanding dataset built by Zhejiang University, Jiangnan University, Peking University, Alibaba Group, and Ant Group in 2024.LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations". LAION-SG contains 540,005 scene graph-image pairs with object, attribute and relationship annotations, which are divided into training, validation and test sets. The images in the dataset are from the LAION-Aesthetics V2 (6.5+) dataset, and the annotation process uses GPT-4o for automatic annotation.

Compared to the original LAION-Aesthetics dataset, LAION-SG has improved both the average annotation length and accuracy. Each sample in this dataset contains an average of 6.39 objects, and the object information has increased by 20%. If abstract proper nouns are excluded, this advantage increases to 216%.

The LAION-SG dataset is suitable for a variety of cross-modal research fields of images and text, including image description generation, visual question answering systems, and image retrieval tasks, all of which rely on a deep understanding and semantic parsing of image content.

    LAION-SG.torrent
    Seeding 1Downloading 0Completed 145Total Downloads 269
    • LAION-SG/
      • README.md
        1.85 KB
      • README.txt
        3.69 KB
        • data/
          • LAION-SG.zip
            158.26 MB

    Build AI with AI

    From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

    AI Co-coding
    Ready-to-use GPUs
    Best Pricing
    Get Started

    Hyper Newsletters

    Subscribe to our latest updates
    We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
    Powered by MailChimp
    LAION-SG Large-scale high-quality Image Understanding Dataset | Datasets | HyperAI