Variable Selection in Model-Based Clustering: To Do or To Facilitate - Citegraph

Paper Info

Title
Variable Selection in Model-Based Clustering: To Do or To Facilitate

Abstract
Variable selection for cluster analysis is a difficult problem. The difficulty originates not only from the lack of class information but also the fact that high-dimensional data are often multifaceted and can be meaningfully clustered in multiple ways. In such a case the effort to find one subset of attributes that presumably gives the "best" clustering may be misguided. It makes more sense to facilitate variable selection by domain experts, that is, to systematically identify various facets of a data set (each being based on a subset of attributes), cluster the data along each one, and present the results to the domain experts for appraisal and selection. In this paper, we propose a generalization of the Gaussian mixture model, show its ability to cluster data along multiple facets, and demonstrate it is often more reasonable to facilitate variable selection than to perform it. Copyright 2010 by the author(s)/owner(s).

Year	DOI	Venue
2010	null	ICML
Keywords	Field	DocType
variable selection	Data mining,Pattern recognition,Feature selection,Computer science,Artificial intelligence,Cluster analysis,Machine learning,Mixture model	Conference
Volume	Issue	ISSN
null	null	null
Citations	PageRank	References
13	0.62	10
Authors
4

Authors (4 rows)

Cited by (13 rows)

References (10 rows)

Name	Order	Citations	PageRank
Leonard K. M. Poon	1	94	10.96
Nevin .L Zhang	2	895	97.21
Tao Chen	3	76	7.04
Yi Wang	4	64	5.86

1