Abstract | ||
---|---|---|
In this paper, we develop a novel global-attention-based neural network (GANN) for vision language intelligence, specifically, image captioning (language description of a given image). As many previous works, the encoder-decoder framework is adopted in our proposed model, in which the encoder is responsible for encoding the region proposal features and extracting global caption feature based on a ... |
Year | DOI | Venue |
---|---|---|
2021 | 10.1109/JAS.2020.1003402 | IEEE/CAA Journal of Automatica Sinica |
Keywords | DocType | Volume |
Feature extraction,Proposals,Decoding,Visualization,Neural networks,Semantics,Task analysis | Journal | 8 |
Issue | ISSN | Citations |
7 | 2329-9266 | 2 |
PageRank | References | Authors |
0.34 | 0 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Pei Liu | 1 | 12 | 1.50 |
Yingjie Zhou | 2 | 4 | 1.73 |
Dezhong Peng | 3 | 285 | 27.92 |
Dapeng Wu | 4 | 4463 | 325.77 |