Title
The inference of protein-protein interactions by co-evolutionary analysis is improved by excluding the information about the phylogenetic relationships.
Abstract
The prediction of protein-protein interactions is currently an important issue in bioinformatics. The mirror tree method uses evolutionary information to predict protein-protein interactions. However, it has been recognized that predictions by the mirror tree method lead to many false positives. The incentive of our study was to solve this problem by improving the method of extracting the co-evolutionary information regarding the protein pairs.We developed a novel method to predict protein-protein interactions from co-evolutionary information in the framework of the mirror tree method. The originality is the use of the projection operator to exclude the information about the phylogenetic relationships among the source organisms from the distance matrix. Each distance matrix was transformed into a vector for the operation. The vector is referred to as a 'phylogenetic vector'. We have proposed three ways to extract the phylogenetic information: (1) using the 16S rRNA from the same source organisms as the proteins under consideration, (2) averaging the phylogenetic vectors and (3) analyzing the principal components of the phylogenetic vectors. We examined the performance of the proposed methods to predict interacting protein pairs from Escherichia coli, using experimentally verified data. Our method was successful, and it drastically reduced the number of false positives in the prediction.The R script for the prediction of protein-protein interactions reported in this manuscript is available at http://timpani.genome.ad.jp/~proj/sato@kuicr.kyoto-u.ac.jpThe information is also available at the same site as the R script.
Year
DOI
Venue
2005
10.1093/bioinformatics/bti564
Bioinformatics
Keywords
Field
DocType
distance matrix,false positive,source organism,phylogenetic relationship,interacting protein pair,r script,protein interaction,mirror tree method,co-evolutionary information,co-evolutionary analysis,phylogenetic vector,protein pair,protein protein interaction,projection operator,escherichia coli,principal component
Sequence alignment,Data mining,Phylogenetic tree,Inference,Computer science,Projection (linear algebra),Distance matrix,Bioinformatics,Phylogenetics,Principal component analysis,False positive paradox
Journal
Volume
Issue
ISSN
21
17
1367-4803
Citations 
PageRank 
References 
22
1.39
7
Authors
4
Name
Order
Citations
PageRank
Tetsuya Sato1222.74
Yoshihiro Yamanishi2126883.44
Minoru Kanehisa34429707.80
Hiroyuki Toh4221.39