Title | ||
---|---|---|
Multi-marker tagging single nucleotide polymorphism selection using estimation of distribution algorithms. |
Abstract | ||
---|---|---|
This paper presents an optimization algorithm for the automatic selection of a minimal subset of tagging single nucleotide polymorphisms (SNPs).The determination of the set of minimal tagging SNPs is approached as an optimization problem in which each tagged SNP can be covered by a single tagging SNP or by a pair of tagging SNPs. The problem is solved using an estimation of distribution algorithm (EDA) which takes advantage of the underlying topological structure defined by the SNP correlations to model the problem interactions. The EDA stochastically searches the constrained space of feasible solutions. It is evaluated across HapMap reference panel data sets.The EDA was compared with a SAT solver, able to find the single-marker minimal tagging sets, and with the Tagger program. The percentage of reduction ranged from 10% to 43% in the number of tagging SNPs of the minimal multi-marker tagging set found by the EDA with respect to the other algorithms.The introduced algorithm is effective for the identification of minimal multi-marker SNP sets, which considerably reduce the dimension of the tagging SNP set in comparison with single-marker sets. Other variants of the SNP problem can be treated following the same approach. |
Year | DOI | Venue |
---|---|---|
2010 | 10.1016/j.artmed.2010.05.010 | Artificial Intelligence In Medicine |
Keywords | Field | DocType |
minimal subset,minimal tagging snps,minimal multi-marker,snp correlation,hapmap,multi-marker selection,tagging snps,single nucleotide polymorphism selection,tagging snp,minimal multi-marker snp set,single tagging snp,tagging single nucleotide polymorphism selection,distribution algorithm,snp problem,estimation of distribution algorithms,single-marker minimal tagging set,single nucleotide polymorphism,estimation of distribution algorithm | Data mining,Tagging SNP,Estimation of distribution algorithm,Computer science,International HapMap Project,Boolean satisfiability problem,Optimization algorithm,Single-nucleotide polymorphism,SNP,Optimization problem | Journal |
Volume | Issue | ISSN |
50 | 3 | 1873-2860 |
Citations | PageRank | References |
4 | 0.47 | 22 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Roberto Santana | 1 | 357 | 19.04 |
Alexander Mendiburu | 2 | 355 | 33.61 |
Noah Zaitlen | 3 | 67 | 9.54 |
Eleazar Eskin | 4 | 1790 | 170.53 |
José A. Lozano | 5 | 2148 | 167.25 |