Title
A modular eigen subspace scheme for high-dimensional data classification
Abstract
In this paper, a novel filter-based greedy modular subspace (GMS) technique is proposed to improve the accuracy of high-dimensional data classification. The proposed approach initially divides the whole set of high-dimensional features into several arbitrary number of highly correlated subgroups by performing a greedy correlation matrix reordering transformation for each class. These GMS can be treated as not only a preprocess of GMS filter-based classifiers but also a unique feature extractor to generate a particular feature subspaces for each different class presented in high-dimensional data. The similarity measures are next calculated by projecting the samples into different modular feature subspaces. Finally, a GMS filter-based architecture based on the mean absolute errors criterion is adopted to build a non-linear multi-class classifier. The proposed GMS filter-based classification scheme is developed to find non-linear boundaries of different classes for high-dimensional data. It not only significantly improves the classification accuracy but also dramatically reduces the computational complexity of feature extraction compared with the conventional principal components analysis. Experimental results demonstrate that the proposed GMS feature extraction method suits the GMS filter-based classifier best as a classification preprocess. It significantly improves the precision of high-dimensional data classification.
Year
DOI
Venue
2004
10.1016/j.future.2003.11.003
Future Generation Comp. Syst.
Keywords
Field
DocType
mean absolute errors,high-dimensional data,different modular feature subspaces,principal components analysis,greedy modular subspaces,classification accuracy,proposed gms filter-based classification,high-dimensional data classification,proposed gms feature extraction,gms filter-based architecture,classification preprocess,gms filter-based classifier,different class,modular eigen subspace scheme,principal component analysis,high dimensional data,mean absolute error,feature extraction,computational complexity,correlation matrix
Clustering high-dimensional data,Subspace topology,Pattern recognition,Computer science,Feature extraction,Artificial intelligence,Covariance matrix,Data classification,Classifier (linguistics),Principal component analysis,Computational complexity theory
Journal
Volume
Issue
ISSN
20
7
Future Generation Computer Systems
Citations 
PageRank 
References 
5
0.47
14
Authors
6
Name
Order
Citations
PageRank
Yang-Lang Chang129335.04
Chin-Chuan Han266862.34
Fan-Di Jou3462.95
Kuo-chin Fan41369117.82
K. S. Chen581.79
Jeng-Horng Chang6573.30