Title
Computing motif correlations in proteins.
Abstract
Protein motifs, which are specific regions and conserved regions, are found by comparing multiple protein sequences. These conserved regions in general play an important role in protein functions and protein folds, for example, for their binding properties or enzymatic activities. The aim here is to find the existence correlations of protein motifs. The knowledge of protein motif/domain sharing should be important in shedding new light on the biologic functions of proteins and offering a basis in analyzing the evolution in the human genome or other genomes. The protein sequences used here are obtained from the PIR-NREF database and the protein motifs are retrieved from the PROSITE database. We apply data mining approach to discover the occurrence correlations of motif in protein sequences. The correlation of motifs mined can be used in evolution analyses and protein structure prediction. We discuss the latter, i.e., protein structure prediction in this study. The correlations mined are stored and maintained in a database system. The database is now available at http://bioinfo.csie.ncu.edu.tw/ProMotif/. (C) 2003 Wiley Periodicals, Inc.
Year
DOI
Venue
2003
10.1002/jcc.10332
JOURNAL OF COMPUTATIONAL CHEMISTRY
Keywords
Field
DocType
protein,motif,structural genomics,data mining,database
Protein structure prediction,Data mining,Protein domain,Structural genomics,Computational chemistry,Chemistry,Structural motif,Computational biology,PROSITE,Protein function prediction,Hypothetical protein,Multiple EM for Motif Elicitation
Journal
Volume
Issue
ISSN
24
16
0192-8651
Citations 
PageRank 
References 
0
0.34
18
Authors
6
Name
Order
Citations
PageRank
Jorng-Tzong Horng154167.78
Hsien-Da Huang283563.83
Shih-hsien Wang311.04
Ming-You Chen400.68
Shir-Ly Huang5254.60
Jenn-Kang Hwang61499.70