Hierarchical Graph Attention Network for Visual Relationship Detection - Citegraph

Paper Info

Title
Hierarchical Graph Attention Network for Visual Relationship Detection

Abstract
Visual Relationship Detection (VRD) aims to describe the relationship between two objects by providing a structural triplet shown as . Existing graph-based methods mainly represent the relationships by an object-level graph, which ignores to model the triplet-level dependencies. In this work, a Hierarchical Graph Attention Network (HGAT) is proposed to capture the dependencies on both object-level and triplet-level. Object-level graph aims to capture the interactions between objects, while the triplet-level graph models the dependencies among relation triplets. In addition, prior knowledge and attention mechanism are introduced to fix the redundant or missing edges on graphs that are constructed according to spatial correlation. With these approaches, nodes are allowed to attend over their spatial and semantic neighborhoods\u0027 features based on the visual or semantic feature correlation. Experimental results on the well-known VG and VRD datasets demonstrate that our model significantly outperforms the state-of-the-art methods.

Year	DOI	Venue
2020	10.1109/CVPR42600.2020.01390	CVPR
DocType	Citations	PageRank
Conference	2	0.36
References	Authors
26	2

Authors (2 rows)

Cited by (2 rows)

References (26 rows)

Name	Order	Citations	PageRank
Li Mi	1	2	1.03
Zhenzhong Chen	2	1244	101.41

1