Title
Memetic Algorithms for De Novo Motif Discovery
Abstract
Identifying the unknown transcription factor binding sites (TFBSs) is a fundamental and important component for understanding gene regulation as well as life mechanisms. The corresponding de novo motif discovery problem in bioinformatics is formulated as pattern discovery from strings, where challenges come from both modeling and optimization, because the short TFBSs are weak signals in massive and noisy experimental data. While genetic algorithms have been widely applied to the problem, recent memetic algorithms (MAs) employing local operators demonstrate the superiority in both effectiveness and efficiency. In this paper, we propose and study various MA components including local operators and models for motif discovery, through the newly established MA framework. The demonstrated optimization and modeling capabilities are analyzed in-depth on real datasets and their noisy versions. Selected optimal MAs show significantly improved performance over state-of-the-art methods in extensive tests including the blind test on the eukaryotic benchmark. This paper serves as the first systematic study of MAs on de novo motif discovery, where important issues are highlighted in the analyses of MA design. The comprehensive component categorization and the MA framework provide a useful platform for future MA developments, especially on the newly emerging chromatin immunoprecipitation followed by sequencing data.
Year
DOI
Venue
2012
10.1109/TEVC.2011.2171972
IEEE Trans. Evolutionary Computation
Keywords
Field
DocType
genetic algorithms,gene expression,dna,memetics,memetic algorithms,benchmark testing,bioinformatics,genetics,data handling,optimization,evolutionary computation
Memetic algorithm,DNA binding site,Evolutionary computation,Motif (music),Artificial intelligence,Memetics,Group method of data handling,Machine learning,Benchmark (computing),Genetic algorithm,Mathematics
Journal
Volume
Issue
ISSN
16
5
1089-778X
Citations 
PageRank 
References 
4
0.40
20
Authors
3
Name
Order
Citations
PageRank
Tak-ming Chan119013.57
Kwong-Sak Leung21887205.58
Kin-Hong Lee325726.27