Title
Discriminative Detection of Cis-Acting Regulatory Variation From Location Data
Abstract
The interaction between transcription factors and their DNA binding sites plays a key role for understanding gene regulation mechanisms. Recent studies revealed the presence of "functional polymorphism" in genes that is defined as regulatory variation measured in transcription levels due to the cis-acting sequence differences. These regulatory variants are assumed to contribute to modulating gene functions. However, computational identifications of such functional cis-regulatory variants is a much greater challenge than just identifying consensus sequences, because cis-regulatory variants differ by only a few bases from the main consensus sequences, while they have important consequences for organismal phenotype. None of the previous studies have directly addressed this problem. We propose a novel discriminative detection method for precisely identifying transcription factor binding sites and their functional variants from both positive and negative samples (sets of upstream sequences of both bound and unbound genes by a transcription factor) based on the genome-wide location data. Our goal is to find such discriminative substrings that best explain the location data in the sense that the substrings precisely discriminate the positive samples from the negative ones rather than finding the substrings that are simply over-represented among the positive ones. Our method consists of two steps: First, we apply a decision tree learning method to discover discriminative substrings and a hierarchical relationship among them. Second, we extract a main motif and further a second motif as a cis-regulatory variant by utilizing functional annotations. Our genome-wide experimental results on yeast Saccharomyces cerevisiae show that our method presented significantly better performances for detecting experimentally verified consensus sequences than current motif detecting methods. In addition, our method has successfully discovered second motifs of putative functional cis-regulatory variants which are associated with genes of different functional annotations, and the correctness of those variants have been verified by expression profile analyses.
Year
DOI
Venue
2006
10.1142/9781860947292_0012
Series on Advances in Bioinformatics and Computational Biology
Keywords
Field
DocType
transcription factor,gene regulation,decision tree learning,transcription factor binding site
Substring,Gene,Transcription (biology),DNA binding site,Biology,Regulation of gene expression,Bioinformatics,Discriminative model,Consensus sequence,Transcription factor
Conference
Volume
Citations 
PageRank 
3
1
0.39
References 
Authors
1
2
Name
Order
Citations
PageRank
Yuji Kawada110.73
Yasubumi Sakakibara276962.91