Title
Dual self-attention with co-attention networks for visual question answering
Abstract
•A novel model based on the self-attention mechanism is proposed to learn more effective multi-modal representations.•The DSACA model is proposed to capture the internal dependencies and cross-modal correlation between the image and question sentence.•Extensive experiments and analysis confirm the superiority of the proposed DSACA.
Year
DOI
Venue
2021
10.1016/j.patcog.2021.107956
Pattern Recognition
Keywords
DocType
Volume
Self-attention,Visual-textual co-attention,Visual question answering
Journal
117
Issue
ISSN
Citations 
1
0031-3203
2
PageRank 
References 
Authors
0.37
0
7
Name
Order
Citations
PageRank
Liu Yun128057.13
Xiaoming Zhang2282.87
Qianyun Zhang320.37
Chaozhuo Li4478.45
Feiran Huang5508.30
Xianghong Tang620.37
Zhoujun Li7964115.99