Title
QMAS: Querying, Mining and Summarization of Multi-modal Databases
Abstract
Given a large collection of images, very few of which have labels, how can we guess the labels of the remaining majority, and how can we spot those images that need brand new labels, different from the existing ones? Current automatic labeling techniques usually scale super linearly with the data size, and/or they fail when only a tiny amount of labeled data is provided. In this paper, we propose QMAS (Querying, Mining And Summarization of Multi-modal Databases), a fast solution to the following problems: (i) low-labor labeling (L3) – given a collection of images, very few of which are labeled with keywords, find the most suitable labels for the remaining ones, and (ii) mining and attention routing – in the same setting, find clusters, the top-NO outlier images, and the top-NR representative images. We report experiments on real satellite images, two large sets (1.5GB and 2.25GB) of proprietary images and a smaller set (17MB) of public images. We show that QMAS scales linearly with the data size, being up to 40 times faster than top competitors (GCap), obtaining better or equal accuracy. In contrast to other methods, QMAS does low-labor labeling (L3), that is, it works even with tiny initial label sets. It also solves both presented problems and spots tiles that potentially require new labels.
Year
DOI
Venue
2010
10.1109/ICDM.2010.150
ICDM
Keywords
Field
DocType
large set,brand new label,tiny amount,qmas scales linearly,tiny initial label set,multi-modal databases,new label,large collection,data size,super linearly,remaining majority,pixel,clustering algorithms,satellites,labeling,data mining,feature extraction,image recognition,clustering
Automatic summarization,Data mining,Computer science,Outlier,Feature extraction,Pixel,Labeled data,Cluster analysis,Modal,Database,Satellite image
Conference
Citations 
PageRank 
References 
3
0.36
18
Authors
9
Name
Order
Citations
PageRank
Robson L. F. Cordeiro152.43
Fan Guo243818.96
Donna S. Haverkamp3231.89
James H. Horne440.71
Ellen K. Hughes526033.52
Gunhee Kim663247.17
Agma J. M. Traina71024153.61
Caetano Traina Jr.81052137.26
Christos Faloutsos9279724490.38