Title
Adaptive discriminant analysis for microarray-based classification
Abstract
Microarray technology has generated enormous amounts of high-dimensional gene expression data, providing a unique platform for exploring gene regulatory networks. However, the curse of dimensionality plagues effort to analyze these high throughput data. Linear Discriminant Analysis (LDA) and Biased Discriminant Analysis (BDA) are two popular techniques for dimension reduction, which pay attention to different roles of the positive and negative samples in finding discriminating subspace. However, the drawbacks of these two methods are obvious: LDA has limited efficiency in classifying sample data from subclasses with different distributions, and BDA does not account for the underlying distribution of negative samples. In this paper, we propose a novel dimension reduction technique for microarray analysis: Adaptive Discriminant Analysis (ADA), which effectively exploits favorable attributes of both BDA and LDA and avoids their unfavorable ones. ADA can find a good discriminative subspace with adaptation to different sample distributions. It not only alleviates the problem of high dimensionality, but also enhances the classification performance in the subspace with naïve Bayes classifier. To learn the best model fitting the real scenario, boosted Adaptive Discriminant Analysis is further proposed. Extensive experiments on the yeast cell cycle regulation data set, and the expression data of the red blood cell cycle in malaria parasite Plasmodium falciparum demonstrate the superior performance of ADA and boosted ADA. We also present some putative genes of specific functional classes predicted by boosted ADA. Their potential functionality is confirmed by independent predictions based on Gene Ontology, demonstrating that ADA and boosted ADA are effective dimension reduction methods for microarray-based classification.
Year
DOI
Venue
2008
10.1145/1342320.1342325
TKDD
Keywords
Field
DocType
linear discriminant analysis,biased discriminant analysis,different distribution,expression data,microarray-based classification,high throughput data,adaptive discriminant analysis,lda,classifying sample data,ada,different role,dimension reduction,bda,high-dimensional gene expression data,microarray,boosted ada,negative sample,microarray analysis,discriminant analysis,high throughput,gene regulatory network,curse of dimensionality,bayes classifier,cell cycle regulation
Effective dimension,Data mining,Dimensionality reduction,Subspace topology,Naive Bayes classifier,Computer science,Curse of dimensionality,Artificial intelligence,Gene chip analysis,Linear discriminant analysis,Discriminative model,Machine learning
Journal
Volume
Issue
Citations 
2
1
1
PageRank 
References 
Authors
0.35
6
5
Name
Order
Citations
PageRank
Yijuan Lu173246.24
Qi Tian26443331.75
Jennifer L. Neary331.74
Feng Liu430.72
Yufeng Wang55114.37