Title
Predicting protein structure classes from function predictions.
Abstract
We introduce a new approach to using the information contained in sequence-to-function prediction data in order to recognize protein template classes, a critical step in predicting protein structure. The data on which our method is based comprise probabilities of functional categories; for given query sequences these probabilities are obtained by a neural net that has previously been trained on a variety of functionally important features. On a training set of sequences we assess the relevance of individual functional categories for identifying a given structural family. Using a combination of the most relevant categories, the likelihood of a query sequence to belong to a specific family can be estimated.The performance of the method is evaluated using cross-validation. For a fixed structural family and for every sequence, a score is calculated that measures the evidence for family membership. Even for structural families of small size, family members receive significantly higher scores. For some examples, we show that the relevant functional features identified by this method are biologically meaningful. The proposed approach can be used to improve existing sequence-to-structure prediction methods.Matlab code is available on request from the authors. The data are available at http://www.mpisb.mpg.de/~sommer/Fun2Struc/
Year
DOI
Venue
2004
10.1093/bioinformatics/btg483
Bioinformatics
Keywords
Field
DocType
family member,individual functional category,predicting protein structure class,functional category,family membership,structural family,specific family,sequence-to-function prediction data,query sequence,fixed structural family,relevant functional feature,cross validation,protein structure,neural net
Sequence alignment,Training set,Data mining,MATLAB,Computer science,Software,Structure function,Artificial neural network,Protein structure
Journal
Volume
Issue
ISSN
20
5
1367-4803
Citations 
PageRank 
References 
1
0.38
4
Authors
5
Name
Order
Citations
PageRank
I Sommer140.82
Jörg Rahnenführer259438.14
F S Domingues310.38
U de Lichtenberg410.38
T. Lengauer511015.00