HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

ArtQuest: Countering Hidden Language Biases in ArtVQA

{Gerard de Melo Sedigheh Eslami Tibor Bleidt}

ArtQuest: Countering Hidden Language Biases in ArtVQA

Abstract

The task of Visual Question Answering (VQA) has been studied extensively on general-domain real-world images. Transferring insights from general domain VQA to the art domain (ArtVQA) is non-trivial, as the latter requires models to identify abstract concepts, details of brushstrokes and styles of paintings in the visual data as well as possess background knowledge about art. This is exacerbated by the lack of high-quality datasets. In this work, we shed light on hidden linguistic biases in the AQUA dataset, which is the only publicly available benchmark dataset for ArtVQA. As a result, the majority of questions can be answered without consulting the visual information, making the “V” in ArtVQA rather insignificant. In order to counter this problem, we create a simple, yet practical dataset, ArtQuest, using structured information from the SemArt collection. Our dataset and the pipeline to reproduce our results are publicly available at https://github.com/bletib/artquest.

Benchmarks

BenchmarkMethodologyMetrics
visual-question-answering-vqa-on-artquestPrefixLM with CLIP and T5
1:1 Accuracy: 50.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
ArtQuest: Countering Hidden Language Biases in ArtVQA | Papers | HyperAI