HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Neural Motifs: Scene Graph Parsing with Global Context

Rowan Zellers; Mark Yatskar; Sam Thomson; Yejin Choi

Neural Motifs: Scene Graph Parsing with Global Context

Abstract

We investigate the problem of producing structured graph representations of visual scenes. Our work analyzes the role of motifs: regularly appearing substructures in scene graphs. We present new quantitative insights on such repeated structures in the Visual Genome dataset. Our analysis shows that object labels are highly predictive of relation labels but not vice-versa. We also find that there are recurring patterns even in larger subgraphs: more than 50% of graphs contain motifs involving at least two relations. Our analysis motivates a new baseline: given object detections, predict the most frequent relation between object pairs with the given labels, as seen in the training set. This baseline improves on the previous state-of-the-art by an average of 3.6% relative improvement across evaluation settings. We then introduce Stacked Motif Networks, a new architecture designed to capture higher order motifs in scene graphs that further improves over our strong baseline by an average 7.1% relative gain. Our code is available at github.com/rowanz/neural-motifs.

Code Repositories

HCPLab-SYSU/KERN
pytorch
Mentioned in GitHub
shikorab/SceneGraph
tf
Mentioned in GitHub
bknyaz/sgg
pytorch
Mentioned in GitHub
rowanz/neural-motifs
Official
pytorch
Mentioned in GitHub
yuweihao/KERN
pytorch
Mentioned in GitHub
microsoft/scene_graph_benchmark
pytorch
Mentioned in GitHub
zhangce01/HiKER-SGG
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
panoptic-scene-graph-generation-on-psgMOTIFS
R@20: 20.0
mR@20: 9.10

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Neural Motifs: Scene Graph Parsing with Global Context | Papers | HyperAI