Title
Optimizing long intrinsic disorder predictors with protein evolutionary information.
Abstract
Protein existing as an ensemble of structures, called intrinsically disordered, has been shown to be responsible for a wide variety of biological functions and to be common in nature. Here we focus on improving sequence-based predictions of long (>30 amino acid residues) regions lacking specific 3-D structure by means of four new neural-network-based Predictors Of Natural Disordered Regions (PONDRs): VL3, VL3H, VL3P, and VL3E. PONDR VL3 used several features from a previously introduced PONDR VL2, but benefitted from optimized predictor models and a slightly larger (152 vs. 145) set of disordered proteins that were cleaned of mislabeling errors found in the smaller set. PONDR VL3H utilized homologues of the disordered proteins in the training stage, while PONDR VL3P used attributes derived from sequence profiles obtained by PSI-BLAST searches. The measure of accuracy was the average between accuracies on disordered and ordered protein regions. By this measure, the 30-fold cross-validation accuracies of VL3, VL3H, and VL3P were, respectively, 83.6 +/- 1.4%, 85.3 +/- 1.4%, and 85.2 +/- 1.5%. By combining VL3H and VL3P, the resulting PONDR VL3E achieved an accuracy of 86.7 +/- 1.4%. This is a significant improvement over our previous PONDRs VLXT (71.6 +/- 1.3%) and VL2 (80.9 +/- 1.4%). The new disorder predictors with the corresponding datasets are freely accessible through the web server at http://www.ist.temple.edu/disprot.
Year
DOI
Venue
2005
10.1142/S0219720005000886
J. Bioinformatics and Computational Biology
Keywords
Field
DocType
psi-blast,neural networks,pondr,prediction,evolutionary information,intrinsic protein disorder,amino acid,neural network,cross validation
Biology,Disorder predictors,Artificial intelligence,Bioinformatics,Artificial neural network,Instrumental and intrinsic value,Machine learning,Evolutionary information
Journal
Volume
Issue
ISSN
3
1
0219-7200
Citations 
PageRank 
References 
26
1.48
13
Authors
6
Name
Order
Citations
PageRank
Kang Peng117011.85
Slobodan Vucetic263756.38
Predrag Radivojac364658.89
Celeste J Brown4868.43
A. Keith Dunker546677.54
Zoran Obradovic61110137.41