Title
Harnessing Diversity Towards The Reconstructing Of Large Scale Gene Regulatory Networks
Abstract
Elucidating gene regulatory network (GRN) from large scale experimental data remains a central challenge in systems biology. Recently, numerous techniques, particularly consensus driven approaches combining different algorithms, have become a potentially promising strategy to infer accurate GRNs. Here, we develop a novel consensus inference algorithm, TopkNet that can integrate multiple algorithms to infer GRNs. Comprehensive performance benchmarking on a cloud computing framework demonstrated that (i) a simple strategy to combine many algorithms does not always lead to performance improvement compared to the cost of consensus and (ii) TopkNet integrating only high-performance algorithms provide significant performance improvement compared to the best individual algorithms and community prediction. These results suggest that a priori determination of high-performance algorithms is a key to reconstruct an unknown regulatory network. Similarity among gene-expression datasets can be useful to determine potential optimal algorithms for reconstruction of unknown regulatory networks, i. e., if expression-data associated with known regulatory network is similar to that with unknown regulatory network, optimal algorithms determined for the known regulatory network can be repurposed to infer the unknown regulatory network. Based on this observation, we developed a quantitative measure of similarity among gene-expression datasets and demonstrated that, if similarity between the two expression datasets is high, TopkNet integrating algorithms that are optimal for known dataset perform well on the unknown dataset. The consensus framework, TopkNet, together with the similarity measure proposed in this study provides a powerful strategy towards harnessing the wisdom of the crowds in reconstruction of unknown regulatory networks.
Year
DOI
Venue
2013
10.1371/journal.pcbi.1003361
PLOS COMPUTATIONAL BIOLOGY
Keywords
Field
DocType
computational biology,gene expression,algorithms,gene expression profiling,gene regulatory networks
Data mining,Similarity measure,Biology,Inference,A priori and a posteriori,Systems biology,Bioinformatics,Gene regulatory network,Benchmarking,Performance improvement,Cloud computing
Journal
Volume
Issue
ISSN
9
11
1553-734X
Citations 
PageRank 
References 
6
0.43
20
Authors
4
Name
Order
Citations
PageRank
Takeshi Hase1352.54
Samik Ghosh211812.12
Ryota Yamanaka360.43
Hiroaki Kitano43515539.37