Title
Kickstarting the Commons: The YFCC100M and the YLI Corpora
Abstract
The publication of the Yahoo Flickr Creative Commons 100 Million dataset (YFCC100M)--to date the largest open-access collection of photos and videos--has provided a unique opportunity to stimulate new research in multimedia analysis and retrieval. To make the YFCC100M even more valuable, we have started working towards supplementing it with a comprehensive set of precomputed features and high-quality ground truth annotations. As part of our efforts, we are releasing the YLI feature corpus, as well as the YLI-GEO and YLI-MED annotation subsets. Under the Multimedia Commons Project (MMCP), we are currently laying the groundwork for a common platform and framework around the YFCC100M that (i) facilitates researchers in contributing additional features and annotations, (ii) supports experimentation on the dataset, and (iii) enables sharing of obtained results. This paper describes the YLI features and annotations released thus far, and sketches our vision for the MMCP.
Year
DOI
Venue
2015
10.1145/2814815.2816986
MM '15: ACM Multimedia Conference Brisbane Australia October, 2015
Keywords
Field
DocType
Multimedia,datasets,annotations,YFCC100M,YLI
World Wide Web,Annotation,Information retrieval,Computer science,Ground truth,Creative commons,Commons
Conference
ISBN
Citations 
PageRank 
978-1-4503-3744-1
1
0.40
References 
Authors
0
13
Name
Order
Citations
PageRank
Julia Bernd1194.98
Damian Borth276449.45
Carmen J. Carrano321.43
Jae-Young Choi4783110.19
Benjamin Elizalde535922.38
Gerald Friedland6112796.23
Luke Gottlieb7615.79
Karl Ni810.73
Roger Pearce924419.40
Douglas Poland1024210.40
Khalid Ashraf1180.98
David A. Shamma121622100.50
Bart Thomee1377339.96