Title
An integrated approach of particle swarm optimization and support vector machine for gene signature selection and cancer prediction
Abstract
To improve cancer diagnosis and drug development, the classification of tumor types based on genomic information is important. As DNA microarray studies produce a large amount of data, expression data are highly redundant and noisy, and most genes are believed to be uninformative with respect to the studied classes. Only a fraction of genes may present distinct profiles for different classes of samples. Classification tools to deal with these issues are thus important. These tools should learn to robustly identify a subset of informative genes embedded in a large dataset that is contaminated with high dimensional noises. In this paper, an integrated approach of support vector machine (SVM) and particle swarm optimization (PSO) is proposed for this purpose. The proposed approach can simultaneously optimize the selection of feature subset and the classifier through a common solution coding mechanism. As an illustration, the proposed approach is applied to search the combinational gene signatures for predicting histologic response to chemotherapy of osteosarcoma patients. Crossvalidation results show that the proposed approach outperforms other existing methods in terms of classification accuracy. Further validation using an independent dataset shows misclassification of only one out of fourteen patient samples, suggesting that the selected gene signatures can reflect the chemoresistance in osteosarcoma.
Year
DOI
Venue
2009
10.1109/IJCNN.2009.5178827
IJCNN
Keywords
Field
DocType
particle swarm optimization,support vector machine,informative gene,integrated approach,classification tool,independent dataset,feature subset,classification accuracy,gene signature selection,combinational gene signature,large amount,expression data,cancer prediction,bioinformatics,classification algorithms,cancer,dna,digital signatures,surgery,genomics,support vector machines,cross validation,dna microarray,drug development,lab on a chip,kernel
Data mining,Computer science,Digital signature,Artificial intelligence,Classifier (linguistics),Particle swarm optimization,Kernel (linear algebra),Pattern recognition,Support vector machine,Statistical classification,Gene signature,DNA microarray,Machine learning
Conference
ISSN
Citations 
PageRank 
2161-4393
3
0.68
References 
Authors
15
4
Name
Order
Citations
PageRank
C. W. Yeung11076.74
F. H. F. Leung261633.93
K. Y. Chan3784.77
S. H. Ling460940.29