Title
Global-Attention-Based Neural Networks for Vision Language Intelligence
Abstract
In this paper, we develop a novel global-attention-based neural network (GANN) for vision language intelligence, specifically, image captioning (language description of a given image). As many previous works, the encoder-decoder framework is adopted in our proposed model, in which the encoder is responsible for encoding the region proposal features and extracting global caption feature based on a ...
Year
DOI
Venue
2021
10.1109/JAS.2020.1003402
IEEE/CAA Journal of Automatica Sinica
Keywords
DocType
Volume
Feature extraction,Proposals,Decoding,Visualization,Neural networks,Semantics,Task analysis
Journal
8
Issue
ISSN
Citations 
7
2329-9266
2
PageRank 
References 
Authors
0.34
0
4
Name
Order
Citations
PageRank
Pei Liu1121.50
Yingjie Zhou241.73
Dezhong Peng328527.92
Dapeng Wu44463325.77