Abstract | ||
---|---|---|
•We combine visual attention and textual attention to forma dual attention mechanism to guide the image caption generation.•We adopt FCN to predict image tagsand fuse tag generation and image caption generation to train encode-decode model.•Our proposed model achieves state-of-the-artperformance in AIC-ICC image Chinese caption dataset. |
Year | DOI | Venue |
---|---|---|
2020 | 10.1016/j.ipm.2019.102178 | Information Processing & Management |
Keywords | DocType | Volume |
Image caption generation,Textual attention,Visual attention,Dual attention,Fully convolutional network | Journal | 57 |
Issue | ISSN | Citations |
2 | 0306-4573 | 4 |
PageRank | References | Authors |
0.45 | 0 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Maofu Liu | 1 | 4 | 0.79 |
Lingjun Li | 2 | 169 | 17.48 |
Huijun Hu | 3 | 32 | 4.22 |
Weili Guan | 4 | 43 | 10.84 |
Jing Tian | 5 | 367 | 30.59 |