Title
Self-learning K-means clustering: a global optimization approach
Abstract
An appropriate distance is an essential ingredient in various real-world learning tasks. Distance metric learning proposes to study a metric, which is capable of reflecting the data configuration much better in comparison with the commonly used methods. We offer an algorithm for simultaneous learning the Mahalanobis like distance and K-means clustering aiming to incorporate data rescaling and clustering so that the data separability grows iteratively in the rescaled space with its sequential clustering. At each step of the algorithm execution, a global optimization problem is resolved in order to minimize the cluster distortions resting upon the current cluster configuration. The obtained weight matrix can also be used as a cluster validation characteristic. Namely, closeness of such matrices learned during a sample process can indicate the clusters readiness; i.e. estimates the true number of clusters. Numerical experiments performed on synthetic and on real datasets verify the high reliability of the proposed method.
Year
DOI
Venue
2013
10.1007/s10898-012-9854-y
J. Global Optimization
Keywords
Field
DocType
clusters readiness,cluster validation characteristic,cluster distortion,global optimization approach,current cluster configuration,metric learning,algorithm execution,self-learning k-means clustering,data configuration,data separability,sequential clustering,appropriate distance
Hierarchical clustering,k-medians clustering,Fuzzy clustering,Mathematical optimization,CURE data clustering algorithm,Pattern recognition,Correlation clustering,Artificial intelligence,Constrained clustering,Cluster analysis,Mathematics,Single-linkage clustering
Journal
Volume
Issue
ISSN
56
2
0925-5001
Citations 
PageRank 
References 
1
0.35
11
Authors
3
Name
Order
Citations
PageRank
Z. Volkovich17413.19
D. Toledano-Kitai2122.25
gerhard weber3294.10