HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Language-Conditioned Graph Networks for Relational Reasoning

Ronghang Hu; Anna Rohrbach; Trevor Darrell; Kate Saenko

Language-Conditioned Graph Networks for Relational Reasoning

Abstract

Solving grounded language tasks often requires reasoning about relationships between objects in the context of a given task. For example, to answer the question "What color is the mug on the plate?" we must check the color of the specific mug that satisfies the "on" relationship with respect to the plate. Recent work has proposed various methods capable of complex relational reasoning. However, most of their power is in the inference structure, while the scene is represented with simple local appearance features. In this paper, we take an alternate approach and build contextualized representations for objects in a visual scene to support relational reasoning. We propose a general framework of Language-Conditioned Graph Networks (LCGN), where each node represents an object, and is described by a context-aware representation from related objects through iterative message passing conditioned on the textual input. E.g., conditioning on the "on" relationship to the plate, the object "mug" gathers messages from the object "plate" to update its representation to "mug on the plate", which can be easily consumed by a simple classifier for answer prediction. We experimentally show that our LCGN approach effectively supports relational reasoning and improves performance across several tasks and datasets. Our code is available at http://ronghanghu.com/lcgn.

Code Repositories

ronghanghu/lcgn
tf
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
visual-question-answering-on-clevrsingle-hop + LCGN (ours)
Accuracy: 97.9
visual-question-answering-on-gqa-test-devsingle-hop + LCGN (ours)
Accuracy: 55.8
visual-question-answering-on-gqa-test-stdsingle-hop + LCGN (ours)
Accuracy: 56.1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Language-Conditioned Graph Networks for Relational Reasoning | Papers | HyperAI