Title | ||
---|---|---|
Fuzzy c-Means Clustering of Incomplete Data Using Dimension-Wise Fuzzy Variances of Clusters. |
Abstract | ||
---|---|---|
Clustering is an important technique for identifying groups of similar data objects within a data set. Since problems during the data collection and data preprocessing steps often lead to missing values in the data sets, there is a need for clustering methods that can deal with such imperfect data. Approaches proposed in the literature for adapting the fuzzy c-means algorithm to incomplete data work well on data sets with equally sized and shaped clusters. In this paper we present an approach for adapting the fuzzy c-means algorithm to incomplete data that uses the dimension-wise fuzzy variances of clusters for imputation of missing values. In experiments on incomplete real and synthetic data sets with differently sized and shaped clusters, we demonstrate the benefit over the basic approach in terms of the assignment of data objects to clusters and the cluster prototype computation. |
Year | DOI | Venue |
---|---|---|
2016 | 10.1007/978-3-319-40596-4_58 | Communications in Computer and Information Science |
Keywords | Field | DocType |
Clustering,Fuzzy c-means (FCM),Incomplete data,Missing values | Fuzzy clustering,Data mining,CURE data clustering algorithm,Fuzzy classification,Correlation clustering,Computer science,Data pre-processing,FLAME clustering,Missing data,Cluster analysis | Conference |
Volume | ISSN | Citations |
610 | 1865-0929 | 0 |
PageRank | References | Authors |
0.34 | 5 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ludmila Himmelspach | 1 | 24 | 4.62 |
Stefan Conrad | 2 | 168 | 105.91 |