Title
Memory-Based Network for Scene Graph with Unbalanced Relations
Abstract
The scene graph which can be represented by a set of visual triples is composed of objects and the relations between object pairs. It is vital for image captioning, visual question answering, and many other applications. However, there is a long tail distribution on the scene graph dataset, and the tail relation cannot be accurately identified due to the lack of training samples. The problem of the nonstandard label and feature overlap on the scene graph affects the extraction of discriminative features and exacerbates the effect of data imbalance on the model. For these reasons, we propose a novel scene graph generation model that can effectively improve the detection of low-frequency relations. We use the method of memory features to realize the transfer of high-frequency relation features to low-frequency relation features. Extensive experiments on scene graph datasets show that our model significantly improved the performance of two evaluation metrics [email protected] and [email protected] compared with state-of-the-art baselines.
Year
DOI
Venue
2020
10.1145/3394171.3413507
MM '20: The 28th ACM International Conference on Multimedia Seattle WA USA October, 2020
DocType
ISBN
Citations 
Conference
978-1-4503-7988-5
0
PageRank 
References 
Authors
0.34
20
6
Name
Order
Citations
PageRank
Weitao Wang166.26
Ruyang Liu200.34
Meng Wang32411.05
Sen Wang447737.24
Xiaojun Chang5158576.85
Chen Yang617243.55