Title
A heuristic-based fuzzy co-clustering algorithm for categorization of high-dimensional data
Abstract
Fuzzy co-clustering is a technique that performs simultaneous fuzzy clustering of objects and features. It is known to be suitable for categorizing high-dimensional data, due to its dynamic dimensionality reduction mechanism achieved through simultaneous feature clustering. We introduce a new fuzzy co-clustering algorithm called Heuristic Fuzzy Co-clustering with the Ruspini's condition (HFCR), which addresses several issues in some prominent existing fuzzy co-clustering algorithms. Among these issues are the performance on data sets with overlapping feature clusters and the unnatural representation of feature clusters. The key idea behind HFCR is the formulation of the dual-partitioning approach for fuzzy co-clustering, replacing the existing partitioning-ranking approach. HFCR adopts an efficient and practical heuristic method that can be shown to be more robust than our earlier effort for the dual-partitioning approach. We explain the proposed algorithm in details and provide an analytical study on its advantages. Experimental results on 10 large benchmark document data sets confirm the effectiveness of the new algorithm.
Year
DOI
Venue
2008
10.1016/j.fss.2007.10.003
Fuzzy Sets and Systems
Keywords
Field
DocType
dual-partitioning approach,information retrieval,high-dimensional data,fuzzy clustering,feature cluster,existing fuzzy co-clustering algorithm,new fuzzy co-clustering algorithm,co-clustering,simultaneous fuzzy clustering,fuzzy co-clustering,large benchmark document data,existing partitioning-ranking approach,heuristic-based fuzzy co-clustering algorithm,high dimensional data,co clustering
Fuzzy clustering,Data mining,Clustering high-dimensional data,Defuzzification,Fuzzy classification,Fuzzy set operations,Fuzzy logic,Algorithm,Fuzzy set,Cluster analysis,Mathematics
Journal
Volume
Issue
ISSN
159
4
Fuzzy Sets and Systems
Citations 
PageRank 
References 
14
0.74
11
Authors
2
Name
Order
Citations
PageRank
William-Chandra Tjhi115610.09
Lihui Chen238027.30