Title
An efficient protein complex mining algorithm based on Multistage Kernel Extension.
Abstract
In recent years, many protein complex mining algorithms, such as classical clique percolation (CPM) method and markov clustering (MCL) algorithm, have developed for protein-protein interaction network. However, most of the available algorithms primarily concentrate on mining dense protein subgraphs as protein complexes, failing to take into account the inherent organizational structure within protein complexes. Thus, there is a critical need to study the possibility of mining protein complexes using the topological information hidden in edges. Moreover, the recent massive experimental analyses reveal that protein complexes have their own intrinsic organization.Inspired by the formation process of cliques of the complex social network and the centrality-lethality rule, we propose a new protein complex mining algorithm called Multistage Kernel Extension (MKE) algorithm, integrating the idea of critical proteins recognition in the Protein- Protein Interaction (PPI) network,. MKE first recognizes the nodes with high degree as the first level kernel of protein complex, and then adds the weighted best neighbour node of the first level kernel into the current kernel to form the second level kernel of the protein complex. This process is repeated, extending the current kernel to form protein complex. In the end, overlapped protein complexes are merged to form the final protein complex set.Here MKE has better accuracy compared with the classical clique percolation method and markov clustering algorithm. MKE also performs better than the classical clique percolation method both on Gene Ontology semantic similarity and co-localization enrichment and can effectively identify protein complexes with biological significance in the PPI network.
Year
DOI
Venue
2013
10.1186/1471-2105-15-S12-S7
BMC Bioinformatics
Keywords
DocType
Volume
protein-protein interaction network,best neighbor node,cliques,protein complexes,weighted best neighbor node,centrality-lethality rule,proteins,multistage kernel extension,gene ontology semantic similarity,molecular biophysics,critical proteins recognition,protein complex mining algorithm,data mining,ppi network,bioinformatics,mke algorithm,colocalization enrichment,complex social network
Conference
15 Suppl 12
Issue
ISSN
Citations 
S-12
1471-2105
0
PageRank 
References 
Authors
0.34
10
6
Name
Order
Citations
PageRank
Xianjun Shen12412.95
Yanli Zhao221.04
Yanan Li321.06
Tingting He434861.04
Jincai Yang5144.72
Xiaohua Hu62819314.15