Title
Deep Interactive Region Segmentation and Captioning
Abstract
Based on recent developments in dense image captioning, it is now possible to describe every object of a photographed scene with a caption while objects are determined by bounding boxes. However, the user interpretation of such an output is not trivial due to the existence of many overlapping bounding boxes. Furthermore, in current captioning frameworks, the user is not able to involve personal preferences to exclude areas that are out of interest. In this paper, we propose a novel hybrid deep learning architecture for interactive region segmentation and captioning whereby the user is able to specify an arbitrary region of the image that should be highlighted and described. To this end, we trained three different highly deep architectures on our special training data to identify the User Intention Region (UIR). In parallel, a dense image captioning model is utilized to locate all the objects of the scene by drawing bounding boxes and produce their linguistic descriptions. During our fusion approach, the detected UIR will be explained with the caption of the best match bounding box. To the best of our knowledge, this is the first work that provides such a comprehensive output. Our experiments show the superiority of the proposed approach over state-of-the-art interactive segmentation methods on several well-known segmentation benchmarks. In addition, replacement of the bounding boxes with the result of the interactive segmentation leads to a better understanding of the dense image captioning output as well as an enhancement in object localization accuracy.
Year
DOI
Venue
2017
10.1109/SITIS.2017.27
2017 13th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)
Keywords
DocType
Volume
nteractive segmentation,deep learning,image captioning,deep hybrid architecture,FCDenseNet
Conference
abs/1707.08364
ISBN
Citations 
PageRank 
978-1-5386-4284-9
1
0.36
References 
Authors
46
3
Name
Order
Citations
PageRank
Ali Sharifi Boroujerdi1122.25
Maryam Khanian240.76
Michael Breuß316825.45