Title
On bicluster aggregation and its benefits for enumerative solutions
Abstract
Biclustering involves the simultaneous clustering of objects and their attributes, thus defining local two-way clustering models. Recently, efficient algorithms were conceived to enumerate all biclusters in real-valued datasets. In this case, the solution composes a complete set of maximal and non-redundant biclusters. However, the ability to enumerate biclusters revealed a challenging scenario: in noisy datasets, each true bicluster may become highly fragmented and with a high degree of overlapping. It prevents a direct analysis of the obtained results. Aiming at reverting the fragmentation, we propose here two approaches for properly aggregating the whole set of enumerated biclusters: one based on single linkage and the other directly exploring the rate of overlapping. Both proposals were compared with each other and with the actual state-of-the-art in several experiments, and they not only significantly reduced the number of biclusters but also consistently increased the quality of the solution.
Year
DOI
Venue
2015
10.1007/978-3-319-21024-7_18
Machine Learning and Data Mining in Pattern Recognition
Keywords
Field
DocType
Biclustering,Bicluster enumeration,Bicluster aggregation,Outlier removal,Metrics for biclusters
Outlier removal,Data mining,Pattern recognition,Computer science,Artificial intelligence,Biclustering,Cluster analysis,Machine learning,Single Linkage
Journal
Volume
ISSN
Citations 
abs/1506.01077
0302-9743
1
PageRank 
References 
Authors
0.35
14
3