Title
Tagging Personal Photos with Transfer Deep Learning
Abstract
The advent of mobile devices and media cloud services has led to the unprecedented growing of personal photo collections. One of the fundamental problems in managing the increasing number of photos is automatic image tagging. Existing research has predominantly focused on tagging general Web images with a well-labelled image database, e.g., ImageNet. However, they can only achieve limited success on personal photos due to the domain gaps between personal photos and Web images. These gaps originate from the differences in semantic distribution and visual appearance. To deal with these challenges, in this paper, we present a novel transfer deep learning approach to tag personal photos. Specifically, to solve the semantic distribution gap, we have designed an ontology consisting of a hierarchical vocabulary tailored for personal photos. This ontology is mined from $10,000$ active users in Flickr with 20 million photos and 2.7 million unique tags. To deal with the visual appearance gap, we discover the intermediate image representations and ontology priors by deep learning with bottom-up and top-down transfers across two domains, where Web images are the source domain and personal photos are the target. Moreover, we present two modes (single and batch-modes) in tagging and find that the batch-mode is highly effective to tag photo collections. We conducted personal photo tagging on 7,000 real personal photos and personal photo search on the MIT-Adobe FiveK photo dataset. The proposed tagging approach is able to achieve a performance gain of $12.8\\%$ and $4.5\\%$ in terms of NDCG@5, against the state-of-the-art hand-crafted feature-based and deep learning-based methods, respectively.
Year
DOI
Venue
2015
10.1145/2736277.2741112
WWW
Keywords
Field
DocType
Personal photo, image tagging, deep learning, transfer learning, ontology
Ontology,World Wide Web,Information retrieval,Computer science,Transfer of learning,Mobile device,Artificial intelligence,Deep learning,Image database,Vocabulary,Cloud computing,Visual appearance
Conference
Citations 
PageRank 
References 
9
0.50
24
Authors
5
Name
Order
Citations
PageRank
Jianlong Fu119522.47
Tao Mei24702288.54
Kuiyuan Yang343524.67
Hanqing Lu44620291.38
Yong Rui57052449.08