Vsr Plus Plus : Improving Visual Semantic Reasoning For Fine-Grained Image-Text Matching - Citegraph

Paper Info

Title
Vsr Plus Plus : Improving Visual Semantic Reasoning For Fine-Grained Image-Text Matching

Abstract
Image-text matching has made great progresses recently, but there still remains challenges in fine-grained matching. To deal with this problem, we propose an Improved Visual Semantic Reasoning model (VSR++), which jointly models 1) global alignment between images and texts and 2) local correspondence between regions and words in a unified framework. To exploit their complementary advantages, we also develop a suitable learning strategy to balance their relative importance. As a result, our model can distinguish image regions and text words in a fine-grained level, and thus achieves the current state-of-the-art performance on two benchmark datasets.

Year	DOI	Venue
2020	10.1109/ICPR48806.2021.9413223	2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR)
DocType	ISSN	Citations
Conference	1051-4651	0
PageRank	References	Authors
0.34	0	6

Authors (6 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Hui Yuan	1	0	0.34
Yan Huang	2	226	27.65
Dongbo Zhang	3	143	19.22
Zerui Chen	4	0	2.37
Wenlong Cheng	5	0	0.34
Liang Wang	6	128	12.87

1