Title
Protein-Protein Interactions More Conserved Within Species Than Across Species
Abstract
Experimental high-throughput studies of protein-protein interactions are beginning to provide enough data for comprehensive computational studies. Today, about ten large data sets, each with thousands of interacting pairs, coarsely sample the interactions in fly, human, worm, and yeast. Another about 55,000 pairs of interacting proteins have been identified by more careful, detailed biochemical experiments. Most interactions are experimentally observed in prokaryotes and simple eukaryotes; very few interactions are observed in higher eukaryotes such as mammals. It is commonly assumed that pathways in mammals can be inferred through homology to model organisms, e. g. the experimental observation that two yeast proteins interact is transferred to infer that the two corresponding proteins in human also interact. Two pairs for which the interaction is conserved are often described as interologs. The goal of this investigation was a large-scale comprehensive analysis of such inferences, i.e. of the evolutionary conservation of interologs. Here, we introduced a novel score for measuring the overlap between protein-protein interaction data sets. This measure appeared to reflect the overall quality of the data and was the basis for our two surprising results from our large-scale analysis. Firstly, homology-based inferences of physical protein-protein interactions appeared far less successful than expected. In fact, such inferences were accurate only for extremely high levels of sequence similarity. Secondly, and most surprisingly, the identification of interacting partners through sequence similarity was significantly more reliable for protein pairs within the same organism than for pairs between species. Our analysis underlined that the discrepancies between different datasets are large, even when using the same type of experiment on the same organism. This reality considerably constrains the power of homology-based transfer of interactions. In particular, the experimental probing of interactions in distant model organisms has to be undertaken with some caution. More comprehensive images of protein-protein networks will require the combination of many high-throughput methods, including in silico inferences and predictions.
Year
DOI
Venue
2006
10.1371/journal.pcbi.0020079
PLOS COMPUTATIONAL BIOLOGY
Keywords
Field
DocType
protein binding,protein protein interaction,high throughput,conserved sequence,binding sites,evolutionary conservation,proteins
Sequence alignment,Protein–protein interaction prediction,Conserved sequence,Protein–protein interaction,Biology,Homology (biology),Bioinformatics,Model organism,Genetics,Drosophila melanogaster,In silico
Journal
Volume
Issue
ISSN
2
7
1553-7358
Citations 
PageRank 
References 
21
1.16
8
Authors
2
Name
Order
Citations
PageRank
Sven Mika11068.59
Burkhard Rost279588.14