Title
Boosting Memory with a Persistent Memory Mechanism for Remote Sensing Image Captioning.
Abstract
The encoder-decoder framework has been widely used in the remote sensing image captioning task. When we need to extract remote sensing images containing specific characteristics from the described sentences for research, rich sentences can improve the final extraction results. However, the Long Short-Term Memory (LSTM) network used in decoders still loses some information in the picture over time when the generated caption is long. In this paper, we present a new model component named the Persistent Memory Mechanism (PMM), which can expand the information storage capacity of LSTM with an external memory. The external memory is a memory matrix with a predetermined size. It can store all the hidden layer vectors of LSTM before the current time step. Thus, our method can effectively solve the above problem. At each time step, the PMM searches previous information related to the input information at the current time from the external memory. Then the PMM will process the captured long-term information and predict the next word with the current information. In addition, it updates its memory with the input information. This method can pick up the long-term information missed from the LSTM but useful to the caption generation. By applying this method to image captioning, our CIDEr scores on datasets UCM-Captions, Sydney-Captions, and RSICD increased by 3%, 5%, and 7%, respectively.
Year
DOI
Venue
2020
10.3390/rs12111874
REMOTE SENSING
Keywords
DocType
Volume
image caption,remote sensing,long short-term memory,persistent memory mechanism
Journal
12
Issue
Citations 
PageRank 
11
0
0.34
References 
Authors
0
5
Name
Order
Citations
PageRank
Kun Fu141457.81
Yang Li200.34
Wenkai Zhang304.73
Hongfeng Yu403.72
Xian Sun532540.42