HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

GraghVQA: Language-Guided Graph Neural Networks for Graph-based Visual Question Answering

Weixin Liang Yanhao Jiang Zixuan Liu

GraghVQA: Language-Guided Graph Neural Networks for Graph-based Visual Question Answering

Abstract

Images are more than a collection of objects or attributes -- they represent a web of relationships among interconnected objects. Scene Graph has emerged as a new modality for a structured graphical representation of images. Scene Graph encodes objects as nodes connected via pairwise relations as edges. To support question answering on scene graphs, we propose GraphVQA, a language-guided graph neural network framework that translates and executes a natural language question as multiple iterations of message passing among graph nodes. We explore the design space of GraphVQA framework, and discuss the trade-off of different design choices. Our experiments on GQA dataset show that GraphVQA outperforms the state-of-the-art model by a large margin (88.43% vs. 94.78%).

Code Repositories

codexxxl/GraphVQA
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
graph-question-answering-on-gqaGraphVQA
Accuracy: 96.30

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
GraghVQA: Language-Guided Graph Neural Networks for Graph-based Visual Question Answering | Papers | HyperAI