Title
Greedy learning of latent tree models for multidimensional clustering
Abstract
Real-world data are often multifaceted and can be meaningfully clustered in more than one way. There is a growing interest in obtaining multiple partitions of data. In previous work we learnt from data a latent tree model (LTM) that contains multiple latent variables (Chen et al. 2012). Each latent variable represents a soft partition of data and hence multiple partitions result in. The LTM approach can, through model selection, automatically determine how many partitions there should be, what attributes define each partition, and how many clusters there should be for each partition. It has been shown to yield rich and meaningful clustering results.Our previous algorithm EAST for learning LTMs is only efficient enough to handle data sets with dozens of attributes. This paper proposes an algorithm called BI that can deal with data sets with hundreds of attributes. We empirically compare BI with EAST and other more efficient LTM learning algorithms, and show that BI outperforms its competitors on data sets with hundreds of attributes. In terms of clustering results, BI compares favorably with alternative methods that are not based on LTMs.
Year
DOI
Venue
2015
10.1007/s10994-013-5393-0
Machine Learning
Keywords
Field
DocType
Model-based clustering,Multiple partitions,Latent tree models
Data mining,Data set,Chen,Pattern recognition,Decision tree model,Model selection,Latent variable,Artificial intelligence,Cluster analysis,Partition (number theory),Machine learning,Mathematics
Journal
Volume
Issue
ISSN
98
1-2
0885-6125
Citations 
PageRank 
References 
9
0.66
25
Authors
6
Name
Order
Citations
PageRank
Tengfei Liu1927.09
Nevin .L Zhang289597.21
Peixian Chen3191.57
April Hua Liu490.66
Leonard K. M. Poon59410.96
Yi Wang6645.86