Neural Motifs: Scene Graph Parsing with Global Context

11/17/2017
by   Rowan Zellers, et al.
0

We investigate the problem of producing structured graph representations of visual scenes. Our work analyzes the role of motifs: regularly appearing substructures in scene graphs. We present new quantitative insights on such repeated structures in the Visual Genome dataset. Our analysis shows that object labels are highly predictive of relation labels but not vice-versa. We also find there are recurring patterns even in larger subgraphs: more than 50 of graphs contain motifs involving at least two relations. This analysis leads to a new baseline that is simple, yet strikingly powerful. While hardly considering the overall visual context of an image, it outperforms previous approaches. We then introduce Stacked Motif Networks, a new architecture for encoding global context that is crucial for capturing higher order motifs in scene graphs. Our best model for scene graph detection achieves a 7.3 improvement in recall@50 (41

READ FULL TEXT

page 1

page 5

page 8

research
09/13/2019

Scene Graph Parsing by Attention Graph

Scene graph representations, which form a graph of visual object nodes t...
research
07/12/2021

Scenes and Surroundings: Scene Graph Generation using Relation Transformer

Identifying objects in an image and their mutual relationships as a scen...
research
08/12/2020

HOSE-Net: Higher Order Structure Embedded Network for Scene Graph Generation

Scene graph generation aims to produce structured representations for im...
research
12/16/2019

Learning Canonical Representations for Scene Graph to Image Generation

Generating realistic images of complex visual scenes becomes very challe...
research
06/16/2021

Tackling the Challenges in Scene Graph Generation with Local-to-Global Interactions

In this work, we seek new insights into the underlying challenges of the...
research
03/20/2023

Location-Free Scene Graph Generation

Scene Graph Generation (SGG) is a challenging visual understanding task....
research
02/01/2020

Unbiased Scene Graph Generation via Rich and Fair Semantic Extraction

Extracting graph representation of visual scenes in image is a challengi...

Please sign up or login with your details

Forgot password? Click here to reset