Title
Improved homology-driven computational validation of protein-protein interactions motivated by the evolutionary gene duplication and divergence hypothesis.
Abstract
Protein-protein interaction (PPI) data sets generated by high-throughput experiments are contaminated by large numbers of erroneous PPIs. Therefore, computational methods for PPI validation are necessary to improve the quality of such data sets. Against the background of the theory that most extant PPIs arose as a consequence of gene duplication, the sensitive search for homologous PPIs, i.e. for PPIs descending from a common ancestral PPI, should be a successful strategy for PPI validation.To validate an experimentally observed PPI, we combine FASTA and PSI-BLAST to perform a sensitive sequence-based search for pairs of interacting homologous proteins within a large, integrated PPI database. A novel scoring scheme that incorporates both quality and quantity of all observed matches allows us (1) to consider also tentative paralogs and orthologs in this analysis and (2) to combine search results from more than one homology detection method. ROC curves illustrate the high efficacy of this approach and its improvement over other homology-based validation methods.New PPIs are primarily derived from preexisting PPIs and not invented de novo. Thus, the hallmark of true PPIs is the existence of homologous PPIs. The sensitive search for homologous PPIs within a large body of known PPIs is an efficient strategy to separate biologically relevant PPIs from the many spurious PPIs reported by high-throughput experiments.
Year
DOI
Venue
2009
10.1186/1471-2105-10-21
BMC Bioinformatics
Keywords
Field
DocType
algorithms,bioinformatics,protein protein interaction,gene duplication,genetic variation,microarrays,roc curve,computational biology,high throughput
Divergence,Protein–protein interaction,Biology,Homology (biology),Extant taxon,Bioinformatics,Genetics,Gene duplication,DNA microarray
Journal
Volume
Issue
ISSN
10
1
1471-2105
Citations 
PageRank 
References 
16
0.40
26
Authors
7
Name
Order
Citations
PageRank
Christian Frech1382.05
Michael Kommenda29715.58
Viktoria Dorfer3282.49
Thomas Kern4181.12
Helmut Hintner5160.40
Johann W Bauer6160.74
Kamil Onder7170.76