Title
A comparison of dimensionality reduction methods using topology preservation indexes
Abstract
Due to the remarkable technological developments experienced in recent decades, the vast amount of data had created new opportunities and challenges in the field of knowledge discovery and data mining. Factors like size and high dimensionality of databases adds difficulties to the complex task of discovering patterns hidden in masses of data. The feasibility of highdimensional data exploration depends on techniques known as dimensionality reduction methods. When class labels are available, an optimization function can be used to maximize intra class cohesion and inter class separation. However, in many practical situations information about class is not available. This paper focuses on unsupervised dimensionality reduction techniques, an important phase in exploratory data analysis. Six important methods are described: Principal components analysis, Sammon projection, Autoassociative Neural network, Kohonen maps, Isomap and Locally Linear Embedding. Three quality indexes are proposed to try to quantify to some degree the topology preservation between input and output spaces. Comparisons are performed using benchmark data sets. Results and tests focused two-dimensional projections for data visualization purposes.
Year
DOI
Venue
2011
10.1007/978-3-642-23878-9_52
IDEAL
Keywords
Field
DocType
intelligent systems,data visualization,dimensionality reduction,neural networks,data mining
Sammon mapping,Data mining,Dimensionality reduction,Computer science,Self-organizing map,Artificial intelligence,Exploratory data analysis,Topology,Data visualization,Pattern recognition,Curse of dimensionality,Knowledge extraction,Machine learning,Isomap
Conference
Volume
ISSN
Citations 
6936
0302-9743
0
PageRank 
References 
Authors
0.34
7
3