Title
A Novel Scalable Algorithm for Supervised Subspace Learning
Abstract
Subspace learning approaches aim to discover important statistical distribution on lower dimensions for high dimensional data. Methods such as Principal Component Analysis (PCA) do not make use of the class information, and Linear Discriminant Analysis (LDA) could not be performed efficiently in a scalable way. In this paper, we propose a novel highly scalable supervised subspace learning algorithm called as Supervised Kampong Measure (SKM). It assigns data points as close as possible to their corresponding class mean, simultaneously assigns data points to be as far as possible from the other class means in the transformed lower dimensional subspace. Theoretical derivation shows that our algorithm is not limited by the number of classes or the singularity problem faced by LDA. Furthermore, our algorithm can be executed in an incremental manner in which learning is done in an online fashion as data streams are received. Experimental results on several datasets, including a very large text data set RCV1, show the outstanding performance of our proposed algorithm on classification problems as compared to PCA, LDA and a popular feature selection approach, Information Gain (IG).
Year
DOI
Venue
2006
10.1109/ICDM.2006.7
ICDM
Keywords
Field
DocType
novel scalable algorithm,linear discriminant analysis,large text data,scalable supervised subspace,data point,class information,corresponding class,high dimensional data,dimensional subspace,proposed algorithm,supervised subspace learning,data stream,principal component analysis,statistical distributions,statistical distribution,learning artificial intelligence,feature selection,information gain
Data mining,Data stream mining,Feature selection,Computer science,Probability distribution,Artificial intelligence,Data point,Clustering high-dimensional data,Pattern recognition,Subspace topology,Linear discriminant analysis,Machine learning,Principal component analysis
Conference
ISSN
ISBN
Citations 
1550-4786
0-7695-2701-9
9
PageRank 
References 
Authors
0.59
12
6
Name
Order
Citations
PageRank
Jun Yan1179885.25
Ning Liu225315.62
Benyu Zhang3213590.41
Qiang Yang417039875.69
Shuicheng Yan51056.20
Zheng Chen65019256.89