Title
ProSNet: integrating homology with molecular networks for protein function prediction.
Abstract
Automated annotation of protein function has become a critical task in the post-genomic era. Network-based approaches and homology-based approaches have been widely used and recently tested in large-scale community-wide assessment experiments. It is natural to integrate network data with homology information to further improve the predictive performance. However, integrating these two heterogeneous, high-dimensional and noisy datasets is non-trivial. In this work, we introduce a novel protein function prediction algorithm ProSNet. An integrated heterogeneous network is first built to include molecular networks of multiple species and link together homologous proteins across multiple species. Based on this integrated network, a dimensionality reduction algorithm is introduced to obtain compact low-dimensional vectors to encode proteins in the network. Finally, we develop machine learning classification algorithms that take the vectors as input and make predictions by transferring annotations both within each species and across different species. Extensive experiments on five major species demonstrate that our integration of homology with molecular networks substantially improves the predictive performance over existing approaches.
Year
DOI
Venue
2017
10.1142/9789813207813_0004
Biocomputing-Pacific Symposium on Biocomputing
Keywords
Field
DocType
protein function prediction,homology,molecular networks,dimensionality reduction,data integration
Data integration,Dimensionality reduction,Biology,Homology (biology),Computational biology,Molecular Sequence Annotation,Genetics,Protein function prediction
Conference
Volume
ISSN
Citations 
22
2335-6936
1
PageRank 
References 
Authors
0.41
4
3
Name
Order
Citations
PageRank
Sheng Wang1498.26
Meng Qu2116337.34
Peng, Jian343050.07