Title
Unsupervised Ranking And Characterization Of Differentiated Clusters
Abstract
We describe a framework for automatically identifying and visualizing the most differentiating attributes of each cluster in a clustered data set. A dissimilarity function measures the cluster-conditional distinguishing saliency of each attribute with respect to a reference realization of the same attribute. For each cluster, the N attributes that are most dissimilar are presented first to the human expert, along with the overall dissimilarity of the cluster. We discuss the computational benefits of the proposed framework, how it can be implemented with map-reduce, its application to the behavioral analysis of mobile phone users, and it broad applicability to diverse problem domains.
Year
DOI
Venue
2013
10.1109/ISI.2013.6578834
2013 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS: BIG DATA, EMERGENT THREATS, AND DECISION-MAKING IN SECURITY INFORMATICS
Keywords
Field
DocType
clustering, dissimilarity, KL divergence, map-reduce
Data mining,Cluster (physics),Ranking,Salience (neuroscience),Pattern clustering,Computer science,Artificial intelligence,Behavioral analysis,Mobile phone,Machine learning
Conference
Citations 
PageRank 
References 
0
0.34
0
Authors
5
Name
Order
Citations
PageRank
Luca Cazzanti11929.49
Courosh Mehanian201.01
Julie Penzotti300.34
Doug Scott400.34
Oliver Downs500.34