Title
Support Vector Machine Classification of Probability Models and Peptide Features for Improved Peptide Identification from Shotgun Proteomics
Abstract
Mass spectrometry (MS)-based proteomics is a powerful and popular high-throughput process for characterizing the global protein content of a sample. In shotgun proteomics, typically proteins are digested into fragments (peptides) prior to mass analysis, and the presence of a protein in inferred from the identification of its constituent peptides. Thus, accurate proteome characterization is dependent upon the accuracy of this peptide identification step. Database search routines generate predicted spectra for all peptides derived from the known genome information, and thus, identify a peptide by 'matching' an experimental to a predicted spectrum. However, due to many problems, such as incomplete fragmentation, this process results in a large number of false positives. We present a new scoring algorithm that integrates probabilistic database scoring metrics (from the MSPolygraph program) with physico-chemical properties in a support vector machine (SVM). We demonstrate that this peptide identification classifier SVM (PICS) score is not only more accurate than the single best database scoring metric, but is also significantly more accurate than models derived using a linear discriminant analysis, decision tree, or artificial neural network.
Year
DOI
Venue
2007
10.1109/ICMLA.2007.103
Cincinnati, OH
Keywords
Field
DocType
scoring algorithm,spectrum,classification,probability,shotgun proteomics,support vector machine,artificial neural network,mass spectroscopy,database search,ecology,high throughput,support vector machines,metrics,algorithms,decision tree,false positive,mass spectrometry,probabilistic database,proteins
Pattern recognition,Proteomics,Computer science,Scoring algorithm,Database search engine,Support vector machine,Proteome,Artificial intelligence,Linear discriminant analysis,Shotgun proteomics,Machine learning,Probabilistic database
Conference
ISBN
Citations 
PageRank 
0-7695-3069-9
1
0.38
References 
Authors
11
3
Name
Order
Citations
PageRank
Bobbie-jo M. Webb-robertson1939.14
Christopher Oehmen27711.11
William R. Cannon36910.68