Feature Selection for Unsupervised Learning via Comparison of Distance Matrices - Citegraph

Paper Info

Title
Feature Selection for Unsupervised Learning via Comparison of Distance Matrices

Abstract
Feature selection for unsupervised learning is generally harder than for supervised learning, because the former lacks the class information of the latter, and thus an obvious way by which to measure the quality of a feature subset. In this paper, we propose a new method based on representing data sets by their distance matrices, and judging feature combinations by how well the distance matrix using only these features resembles the distance matrix of the full data set. Using articial data for which the relevant features were known, we observed that the results depend on the data dimensionality, the fraction of relevant features, the overlap between clusters in the relevant feature subspaces, and how to measure the similarity of distance matrices. Our method consistently achieved higher than 80% detection rates of relevant features for a wide variety of experimental configurations.

Year	DOI	Venue
2013	10.1007/978-3-642-53856-8_26	EUROCAST (1)
Keywords	Field	DocType
Unsupervised feature selection, feature extraction, dimensionality reduction, distance matrix similarity	Dimensionality reduction,Feature selection,Distance matrices in phylogeny,Pattern recognition,Feature (computer vision),Computer science,Feature extraction,Supervised learning,Unsupervised learning,Artificial intelligence,Distance matrix,Machine learning	Conference
Volume	ISSN	Citations
8111	0302-9743	0
PageRank	References	Authors
0.34	8	1

Authors (1 rows)

Cited by (0 rows)

References (8 rows)

Name	Order	Citations	PageRank
Stephan Dreiseitl	1	338	34.80

1