HyperAIHyperAI

Command Palette

Search for a command to run...

Visual7W Visual Question Answering Dataset

Date

3 years ago

Size

1.76 GB

Organization

Stanford University

Publish URL

ai.stanford.edu

Paper URL

arxiv.org

License

Other

Featured Image

Visual7W is a dataset for understanding image content. It performs visual question answering tasks by describing image regions in text and their associations. The dataset contains not only the image itself, but also questions and answers related to the content of the image region.

Visual7W is a subset of the Visual Genome dataset, containing 47,300 COCO dataset images, 327,929 question-answer pairs, 1,311,756 human-generated multiple-choice questions, and 561,459 object groundings covering 36,579 categories.

Visual7W's questions are mainly composed of What, Where, How, When, Who, Why, and Which. The questions are multiple-choice, and each question has four candidate answers.

Visual7W.torrent
Seeding 2Downloading 0Completed 550Total Downloads 673
  • Visual7W/
    • README.md
      1.34 KB
    • README.txt
      2.68 KB
      • data/
        • dataset_v7w_grounding_annotations.zip
          7.07 MB
        • dataset_v7w_pointing.zip
          18.56 MB
        • dataset_v7w_telling.zip
          24.2 MB
        • visual7w-toolkit
          24.39 MB
        • visual7w_images.zip
          1.76 GB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Visual7W Visual Question Answering Dataset | Datasets | HyperAI