Title
Wrapper- and ensemble-based feature subset selection methods for biomarker discovery in targeted metabolomics
Abstract
The discovery of markers allowing for accurate classification of metabolically very similar proband groups constitutes a challenging problem. We apply several search heuristics combined with different classifier types to targeted metabolomics data to identify compound subsets that classify plasma samples of insulin sensitive and -resistant subjects, both suffering from non-alcoholic fatty liver disease. Additionally, we integrate these methods into an ensemble and screen selected subsets for common features. We investigate, which methods appear the most suitable for the task, and test feature subsets for robustness and reproducibility. Furthermore, we consider the predictive potential of different compound classes. We find that classifiers fail in discriminating the non-selected data accurately, but benefit considerably from feature subset selection. Especially, a Pareto-based multi-objective genetic algorithm detects highly discriminative subsets and outperforms widely used heuristics. When transferred to new data, feature sets assembled by the ensemble approach show greater robustness than those selected by single methods.
Year
DOI
Venue
2011
10.1007/978-3-642-24855-9_11
PRIB
Keywords
Field
DocType
compound subsets,common feature,different classifier type,discriminative subsets,targeted metabolomics data,feature subset selection,test feature subsets,biomarker discovery,new data,non-selected data,different compound class,ensemble-based feature subset selection
Hill climbing,Computer science,Robustness (computer science),Heuristics,Artificial intelligence,Classifier (linguistics),Discriminative model,Genetic algorithm,Pattern recognition,Bioinformatics,Biomarker discovery,Pareto principle,Machine learning
Conference
Volume
ISSN
Citations 
7036
0302-9743
1
PageRank 
References 
Authors
0.37
9
6
Name
Order
Citations
PageRank
Holger Franken140.77
Rainer Lehmann241.11
Hans-Ulrich Häring341.11
Andreas Fritsche410.37
Norbert Stefan510.37
Andreas Zell6328.40