Title
DBD2BS: connecting a DNA-binding protein with its binding sites.
Abstract
By binding to short and highly conserved DNA sequences in genomes, DNA-binding proteins initiate, enhance or repress biological processes. Accurately identifying such binding sites, often represented by position weight matrices (PWMs), is an important step in understanding the control mechanisms of cells. When given coordinates of a DNA-binding domain (DBD) bound with DNA, a potential function can be used to estimate the change of binding affinity after base substitutions, where the changes can be summarized as a PWM. This technique provides an effective alternative when the chromatin immunoprecipitation data are unavailable for PWM inference. To facilitate the procedure of predicting PWMs based on protein-DNA complexes or even structures of the unbound state, the web server, DBD2BS, is presented in this study. The DBD2BS uses an atom-level knowledge-based potential function to predict PWMs characterizing the sequences to which the query DBD structure can bind. For unbound queries, a list of 1066 DBDDNA complexes (including 1813 protein chains) is compiled for use as templates for synthesizing bound structures. The DBD2BS provides users with an easy-to-use interface for visualizing the PWMs predicted based on different templates and the spatial relationships of the query protein, the DBDs and the DNAs. The DBD2BS is the first attempt to predict PWMs of DBDs from unbound structures rather than from bound ones. This approach increases the number of existing protein structures that can be exploited when analyzing protein-DNA interactions. In a recent study, the authors showed that the kernel adopted by the DBD2BS can generate PWMs consistent with those obtained from the experimental data. The use of DBD2BS to predict PWMs can be incorporated with sequence-based methods to discover binding sites in genome-wide studies. Available at: http://dbd2bs.csie.ntu.edu.tw/, http://dbd2bs. csbb. ntu. edu. tw/, and http://dbd2bs.ee.ncku.edu.tw.
Year
DOI
Venue
2012
10.1093/nar/gks564
NUCLEIC ACIDS RESEARCH
Field
DocType
Volume
Position-Specific Scoring Matrices,Binding site,Biology,Ligand (biochemistry),Binding protein,Template,Bioinformatics,Chromatin immunoprecipitation,Computational biology,Genetics,A-DNA,Protein structure
Journal
40
Issue
ISSN
Citations 
W1
0305-1048
3
PageRank 
References 
Authors
0.38
10
6
Name
Order
Citations
PageRank
Ting-Ying Chien171.13
Chih-Kang Lin281.55
Chih-Wei Lin3293.46
Yi-Zhong Weng4231.68
Chien-Yu Chen536729.24
Darby Tien-hao Chang622112.58