Title
On the Consistency of k-means++ algorithm.
Abstract
We prove in this paper that the expected value of the objective function of the $k$-means++ algorithm for samples converges to population expected value. As $k$-means++, for samples, provides with constant factor approximation for $k$-means objectives, such an approximation can be achieved for the population with increase of the sample size. This result is of potential practical relevance when one is considering using subsampling when clustering large data sets (large data bases).
Year
DOI
Venue
2017
10.3233/fi-2020-1909
arXiv: Learning
Field
DocType
Volume
Population,k-means clustering,Mathematical optimization,Data set,Algorithm,Expected value,Cluster analysis,Sample size determination,Mathematics
Journal
abs/1702.06120
Issue
Citations 
PageRank 
4
0
0.34
References 
Authors
2
1
Name
Order
Citations
PageRank
Mieczyslaw A. Klopotek136678.58