Initialization Dependence of Clustering Algorithms - Citegraph

Paper Info

Title
Initialization Dependence of Clustering Algorithms

Abstract
It is well known that the clusters produced by a clustering algorithm depend on the chosen initial centers. In this paper we present a measure for the degree to which a given clustering algorithm depends on the choice of initial centers, for a given data set. This measure is calculated for four well-known offline clustering algorithms (k-means Forgy, k-means Hartigan, k-means Lloyd and fuzzy c-means), for five benchmark data sets. The measure is also calculated for ECM, an online algorithm that does not require the number of initial centers as input, but for which the resulting clusters can depend on the order that the input arrives. Our main finding is that this initialization dependence measure can also be used to determine the optimal number of clusters.

Year	DOI	Venue
2008	10.1007/978-3-642-03040-6_75	ICONIP
Keywords	DocType	Volume
k means,online algorithm	Conference	5507
ISSN	Citations	PageRank
0302-9743	1	0.37
References	Authors
7	4

Authors (4 rows)

Cited by (1 rows)

References (7 rows)

Name	Order	Citations	PageRank
Wim De Mulder	1	1	0.37
Stefan Schliebs	2	380	18.56
René Boel	3	56	5.58
Martin Kuiper	4	521	32.47

1