Title
Evolving Transcription Factor Binding Site Models From Protein Binding Microarray Data
Abstract
Protein binding microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner. In this paper, we describe the PBM motif model building problem. We apply several evolutionary computation methods and compare their performance with the interior point method, demonstrating their performance advantages. In addition, given the PBM domain knowledge, we propose and describe a novel method called kmerGA which makes domain-specific assumptions to exploit PBM data properties to build more accurate models than the other models built. The effectiveness and robustness of kmerGA is supported by comprehensive performance benchmarking on more than 200 datasets, time complexity analysis, convergence analysis, parameter analysis, and case studies. To demonstrate its utility further, kmerGA is applied to two real world applications: 1) PBM rotation testing and 2) ChIP-Seq peak sequence prediction. The results support the biological relevance of the models learned by kmerGA, and thus its real world applicability.
Year
DOI
Venue
2017
10.1109/TCYB.2016.2519380
IEEE Trans. Cybernetics
Keywords
Field
DocType
Genetic algorithm,motif discovery,protein binding microarray (PBM),transcription factor (TF) binding site.
Data mining,Data modeling,DNA binding site,Domain knowledge,Computer science,Model building,Evolutionary computation,Robustness (computer science),Artificial intelligence,Time complexity,Genetic algorithm,Machine learning
Journal
Volume
Issue
ISSN
PP
99
2168-2267
Citations 
PageRank 
References 
1
0.35
0
Authors
3
Name
Order
Citations
PageRank
Ka-Chun Wong129140.18
Chengbin Peng2202.50
Yue Li330.76