Title
Semi-supervised network inference using simulated gene expression dynamics.
Abstract
Motivation: Inferring the structure of gene regulatory networks from high-throughput datasets remains an important and unsolved problem. Current methods are hampered by problems such as noise, low sample size, and incomplete characterizations of regulatory dynamics, leading to networks with missing and anomalous links. Integration of prior network information (e.g. from pathway databases) has the potential to improve reconstructions. Results: We developed a semi-supervised network reconstruction algorithm that enables the synthesis of information from partially known networks with time course gene expression data. We adapted partial least square-variable importance in projection (VIP) for time course data and used reference networks to simulate expression data from which null distributions of VIP scores are generated and used to estimate edge probabilities for input expression data. By using simulated dynamics to generate reference distributions, this approach incorporates previously known regulatory relationships and links the network to the dynamics to form a semi-supervised approach that discovers novel and anomalous connections. We applied this approach to data from a sleep deprivation study with KEGG pathways treated as prior networks, as well as to synthetic data from several DREAM challenges, and find that it is able to recover many of the true edges and identify errors in these networks, suggesting its ability to derive posterior networks that accurately reflect gene expression dynamics.
Year
DOI
Venue
2018
10.1093/bioinformatics/btx748
BIOINFORMATICS
Field
DocType
Volume
Data mining,Inference,Computer science,Gene expression,KEGG,Reconstruction algorithm,Synthetic data,Bioinformatics,Gene regulatory network,Sample size determination
Journal
34
Issue
ISSN
Citations 
7
1367-4803
1
PageRank 
References 
Authors
0.35
14
2
Name
Order
Citations
PageRank
Phan Nguyen121.04
Rosemary Braun278872.61