Title
A Novel SVM-Based Decoder for Remote Sensing Image Captioning
Abstract
Most of the remote sensing image captioning (IC) models are based on encoder-decoder frameworks where a convolutional neural network (CNN) encodes the image information and a recurrent neural network (RNN) decodes the image information into a sentence description. In order to achieve good accuracies, encoder-decoder frameworks relying on RNNs typically require a huge amount of annotated samples. Furthermore, they demand high and expensive computational power in order to have reasonable training and testing time. In this article, we aim to address these issues by introducing a novel decoder that is based on support vector machines (SVMs). In particular, instead of RNNs, we propose a novel network of SVMs to decode the image information into a sentence description. The proposed IC system is particularly interesting when just a limited amount of training samples is available. Experiments conducted on four different IC datasets confirm the promising capability of the proposed IC system to generate descriptions that are highly correlated with the image content. The proposed IC system is characterized by short training and inference times compared to other state-of-the-art models.
Year
DOI
Venue
2022
10.1109/TGRS.2021.3105004
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING
Keywords
DocType
Volume
Integrated circuits, Support vector machines, Decoding, Training, Integrated circuit modeling, Feature extraction, Recurrent neural networks, Convolutional neural networks (CNNs), image captioning (IC), recurrent neural networks (RNNs), support vector machines (SVMs), unmanned aerial vehicles (UAVs)
Journal
60
ISSN
Citations 
PageRank 
0196-2892
0
0.34
References 
Authors
0
2
Name
Order
Citations
PageRank
Genc Hoxha101.01
Farid Melgani2110080.98