Title
Global optimization, meta clustering and consensus clustering for class prediction
Abstract
Clustering of real-world data is often ill-posed. Because of noise and intrinsic ambiguity in data, optimization models attempting to maximize a fitness function can be misled by the assumption of uniqueness of the solution. In this work we present a methodology including classic and novel techniques to approach clustering in a systematic way, with two application examples to biological data sets. The methodology is based on a process that generates multiple clustering solutions (using global optimization), performs cluster analysis on such clusterings (i.e. Meta Clustering) and analyzes the obtained clusterings by the appropriate application of different consensus techniques. In order to validate the method, we seek for the solutions that best match the real class labels, exploiting only a random sample of them. Finally, we guess the class labels of the remaining patterns using cluster enrichment information and verify the percentage of correct assignments for each class. The optimization of clustering objective functions together with the use of partial labeling puts the described approach in between unsupervised and semi-supervised methods.
Year
DOI
Venue
2009
10.1109/IJCNN.2009.5178789
IJCNN
Keywords
Field
DocType
real-world data,biological data set,class prediction,meta clustering,global optimization,optimization model,class label,clustering objective function,real class label,multiple clustering solution,application example,clustering algorithms,data models,random sample,cluster analysis,optimization,stochastic processes,fitness function,probability density function,data mining,cognition,frequency,grounding,objective function,biological data,random sampling,stochastic resonance,noise,genetics,neural networks,random processes,sampling methods,consensus clustering
Canopy clustering algorithm,Fuzzy clustering,Data mining,CURE data clustering algorithm,Data stream clustering,Correlation clustering,Computer science,Consensus clustering,Artificial intelligence,Constrained clustering,Cluster analysis,Machine learning
Conference
ISSN
Citations 
PageRank 
2161-4393
2
0.40
References 
Authors
16
5
Name
Order
Citations
PageRank
Ida Bifulco1163.54
Carmine Fedullo271.57
Francesco Napolitano3615.16
Giancarlo Raiconi411815.08
Roberto Tagliaferri542855.64