Title
Pattern-based feature selection in genomics and proteomics
Abstract
A major difficulty in bioinformatics is due to the size of the datasets, which contain frequently large numbers of variables. In this study, we present a two-step procedure for feature selection. In a first “filtering” stage, a relatively small subset of features is identified on the basis of several criteria. In the second stage, the importance of the selected variables is evaluated based on the frequency of their participation in relevant patterns and low impact variables are eliminated. This step is applied iteratively, until arriving to a Pareto-optimal “support set”, which balances the conflicting criteria of simplicity and accuracy.
Year
DOI
Venue
2006
10.1007/s10479-006-0084-x
Annals OR
Keywords
Field
DocType
Feature selection,Genomics,Proteomics,Logical analysis of data,LAD,Patterns
Data mining,Feature selection,Proteomics,Logical analysis of data,Filter (signal processing),Genomics,Mathematics
Journal
Volume
Issue
ISSN
148
1
0254-5330
Citations 
PageRank 
References 
15
0.93
9
Authors
4
Name
Order
Citations
PageRank
Gabriela Alexe119712.75
sorin alexe216910.56
Peter L. Hammer31996288.93
Béla Vizvári4779.40