Tag Prediction at Flickr: A View from the Darkroom. - Citegraph

Paper Info

Title
Tag Prediction at Flickr: A View from the Darkroom.

Abstract
Automated photo tagging has established itself as one of the most compelling applications of deep learning. While deep convolutional neural networks have repeatedly demonstrated top performance on standard datasets for classification, there are a number of often overlooked but important considerations when deploying this technology in a real-world scenario. In this paper, we present our efforts in developing a large-scale photo tagging system for Flickr photo search. We discuss topics including how to 1) select the tags that matter most to our users; 2) develop lightweight, high-performance models for tag prediction; and 3) leverage the power of large amounts of noisy data for training. Our results demonstrate that, for real-world datasets, training exclusively with this noisy data yields performance on par with the standard paradigm of first pre-training on clean data and then fine-tuning. In addition, we observe that the models trained with user-generated data can yield better fine-tuning results when a small amount of clean data is available. As such, we advocate for the approach of harnessing user-generated data in large-scale systems.

Year	DOI	Venue
2017	10.1145/3126686.3126745	MM '17: ACM Multimedia Conference Mountain View California USA October, 2017
Keywords	Field	DocType
deep learning, image classification, photo tagging, tag prediction	Noisy data,Information retrieval,Computer science,Convolutional neural network,Artificial intelligence,Deep learning,Machine learning,Tag system,Darkroom	Conference
ISBN	Citations	PageRank
978-1-4503-5416-5	2	0.41
References	Authors
25	5

Authors (5 rows)

Cited by (2 rows)

References (25 rows)

Name	Order	Citations	PageRank
Kofi Boakye	1	155	13.64
Sachin Sudhakar Farfade	2	16	1.80
Hamid Izadinia	3	164	11.16
Yannis Kalantidis	4	862	33.05
Pierre J. Garrigues	5	21	2.54

1