Title
E1DS: catalytic site prediction based on 1D signatures of concurrent conservation.
Abstract
Large-scale automatic annotation of protein sequences remains challenging in postgenomics era. E1DS is designed for annotating enzyme sequences based on a repository of 1D signatures. The employed sequence signatures are derived using a novel pattern mining approach that discovers long motifs consisted of several sequential blocks (conserved segments). Each of the sequential blocks is considerably conserved among the protein members of an EC group. Moreover, a signature includes at least three sequential blocks that are concurrently conserved, i.e. frequently observed together in sequences. In other words, a sequence signature is consisted of residues from multiple regions of the protein sequence, which echoes the observation that an enzyme catalytic site is usually constituted of residues that are largely separated in the sequence. E1DS currently contains 5421 sequence signatures that in total cover 932 4-digital EC numbers. E1DS is evaluated based on a collection of enzymes with catalytic sites annotated in Catalytic Site Atlas. When compared to the famous pattern database PROSITE, predictions based on E1DS signatures are considered more sensitive in identifying catalytic sites and the involved residues. E1DS is available at http://e1ds.ee.ncku.edu.tw/ and a mirror site can be found at http://e1ds.csbb.ntu.edu.tw/.
Year
DOI
Venue
2008
10.1093/nar/gkn324
NUCLEIC ACIDS RESEARCH
Keywords
Field
DocType
conserved sequence,protein sequence,enzyme,enzymes,amino acid sequence,internet
Conserved sequence,Annotation,Biology,Protein sequencing,Bioinformatics,Computational biology,Genetics,PROSITE,Peptide sequence
Journal
Volume
Issue
ISSN
36
SUPnan
0305-1048
Citations 
PageRank 
References 
4
0.41
14
Authors
5
Name
Order
Citations
PageRank
Ting-Ying Chien171.13
Darby Tien-hao Chang222112.58
Chien-Yu Chen336729.24
Yi-Zhong Weng4231.68
Chen-Ming Hsu5775.77