Title
Multi-marker tagging single nucleotide polymorphism selection using estimation of distribution algorithms.
Abstract
This paper presents an optimization algorithm for the automatic selection of a minimal subset of tagging single nucleotide polymorphisms (SNPs).The determination of the set of minimal tagging SNPs is approached as an optimization problem in which each tagged SNP can be covered by a single tagging SNP or by a pair of tagging SNPs. The problem is solved using an estimation of distribution algorithm (EDA) which takes advantage of the underlying topological structure defined by the SNP correlations to model the problem interactions. The EDA stochastically searches the constrained space of feasible solutions. It is evaluated across HapMap reference panel data sets.The EDA was compared with a SAT solver, able to find the single-marker minimal tagging sets, and with the Tagger program. The percentage of reduction ranged from 10% to 43% in the number of tagging SNPs of the minimal multi-marker tagging set found by the EDA with respect to the other algorithms.The introduced algorithm is effective for the identification of minimal multi-marker SNP sets, which considerably reduce the dimension of the tagging SNP set in comparison with single-marker sets. Other variants of the SNP problem can be treated following the same approach.
Year
DOI
Venue
2010
10.1016/j.artmed.2010.05.010
Artificial Intelligence In Medicine
Keywords
Field
DocType
minimal subset,minimal tagging snps,minimal multi-marker,snp correlation,hapmap,multi-marker selection,tagging snps,single nucleotide polymorphism selection,tagging snp,minimal multi-marker snp set,single tagging snp,tagging single nucleotide polymorphism selection,distribution algorithm,snp problem,estimation of distribution algorithms,single-marker minimal tagging set,single nucleotide polymorphism,estimation of distribution algorithm
Data mining,Tagging SNP,Estimation of distribution algorithm,Computer science,International HapMap Project,Boolean satisfiability problem,Optimization algorithm,Single-nucleotide polymorphism,SNP,Optimization problem
Journal
Volume
Issue
ISSN
50
3
1873-2860
Citations 
PageRank 
References 
4
0.47
22
Authors
5
Name
Order
Citations
PageRank
Roberto Santana135719.04
Alexander Mendiburu235533.61
Noah Zaitlen3679.54
Eleazar Eskin41790170.53
José A. Lozano52148167.25