Title
Asymmetric Bagging and Feature Selection for Activities Prediction of Drug Molecules
Abstract
Activities of drug molecules can be predicted by QSAR (quantitative structure activity relationship) models, which overcomes the disadvantages of high cost and long cycle by employing the traditional experimental method. With the fact that the number of drug molecules with positive activity is rather fewer than that of negatives, it is important to predict molecular activities considering such an unbalanced situation.Here, asymmetric bagging and feature selection are introduced into the problem and asymmetric bagging of support vector machines (asBagging) is proposed on predicting drug activities to treat the unbalanced problem. At the same time, the features extracted from the structures of drug molecules affect prediction accuracy of QSAR models. Therefore, a novel algorithm named PRIFEAB is proposed, which applies an embedded feature selection method to remove redundant and irrelevant features for asBagging. Numerical experimental results on a data set of molecular activities show that asBagging improve the AUC and sensitivity values of molecular activities and PRIFEAB with feature selection further helps to improve the prediction ability.Asymmetric bagging can help to improve prediction accuracy of activities of drug molecules, which can be furthermore improved by performing feature selection to select relevant features from the drug molecules data sets.
Year
DOI
Venue
2007
10.1186/1471-2105-9-S6-S7
BMC Bioinformatics
Keywords
Field
DocType
feature selection,drug molecule,activities prediction,asymmetric bagging,drug molecules aspects prediction,molecular activity,embedded feature selection method,irrelevant feature,qsar model,drug molecules,drug activity,numerical experimental result,computer simulation,feature extraction,microarrays,algorithms,bioinformatics,quantitative structure activity relationship,support vector machine,artificial intelligence
Data mining,Quantitative structure–activity relationship,Feature selection,Computer science,Artificial intelligence,Bioinformatics,Machine learning
Conference
Volume
Issue
ISSN
9
SUPPL. 6
null
ISBN
Citations 
PageRank 
0-7695-3039-7
52
1.27
References 
Authors
25
5
Name
Order
Citations
PageRank
Guo-Zheng Li136842.62
Hao-Hua Meng2521.27
Wen-Cong Lu3654.40
Jack Y. Yang4902175.51
Mary Qu Yang5933191.35