Abstract | ||
---|---|---|
Because of measurements obtained under limited experimental conditions or time points compared to the presence of many genes, also known as the "large dimension, small sample size" problem, dimensionality reduction techniques are a common practice in statistical bioinformatics involving microarray analysis. However, in order to improve the performance of reverse engineering and statistical inference procedures aimed to estimate gene-gene connectivity links, some kind of regularization is usually needed to reduce the overall data complexities, together with ad hoc feature selection to uncover biologically relevant gene associations. The paper deals with feature selection by projective methods; in particular, it addresses some issues: Can the impact of noise on the data be limited by shrinkage or de-noising? How can complexity from convoluted dynamics associated with microarray measurements be discounted? In modeling such data, how to deal with over-parametrization, and control it? The problem of aliasing is then discussed and classified into two categories according to the trade-off between biological relevance and noise, and finally reported in analytical form via subspace analysis. |
Year | DOI | Venue |
---|---|---|
2009 | 10.1142/S0219720009004254 | J. Bioinformatics and Computational Biology |
Keywords | Field | DocType |
projection method,aliasing,feature detection | Data mining,Dimensionality reduction,Feature selection,Regularization (mathematics),Artificial intelligence,Statistical inference,Projective test,Reverse engineering,Aliasing,Bioinformatics,Sample size determination,Machine learning,Mathematics | Journal |
Volume | Issue | ISSN |
7 | 4 | 0219-7200 |
Citations | PageRank | References |
0 | 0.34 | 8 |
Authors | ||
1 |
Name | Order | Citations | PageRank |
---|---|---|---|
Enrico Capobianco | 1 | 22 | 7.39 |