Title
Metagsca: A Tool For Meta-Analysis Of Gene Set Differential Coexpression
Abstract
Analyses of gene set differential coexpression may shed light on molecular mechanisms underlying phenotypes and diseases. However, differential coexpression analyses of conceptually similar individual studies are often inconsistent and underpowered to provide definitive results. Researchers can greatly benefit from an open-source application facilitating the aggregation of evidence of differential coexpression across studies and the estimation of more robust common effects. We developed Meta Gene Set Coexpression Analysis (MetaGSCA), an analytical tool to systematically assess differential coexpression of an a priori defined gene set by aggregating evidence across studies to provide a definitive result. In the kernel, a nonparametric approach that accounts for the gene-gene correlation structure is used to test whether the gene set is differentially coexpressed between two comparative conditions, from which a permutation test p-statistic is computed for each individual study. A meta-analysis is then performed to combine individual study results with one of two options: a random-intercept logistic regression model or the inverse variance method. We demonstrated MetaGSCA in case studies investigating two human diseases and identified pathways highly relevant to each disease across studies. We further applied MetaGSCA in a pan-cancer analysis with hundreds of major cellular pathways in 11 cancer types. The results indicated that a majority of the pathways identified were dysregulated in the pan-cancer scenario, many of which have been previously reported in the cancer literature. Our analysis with randomly generated gene sets showed excellent specificity, indicating that the significant pathways/gene sets identified by MetaGSCA are unlikely false positives. MetaGSCA is a user-friendly tool implemented in both forms of a Web-based application and an R package "MetaGSCA". It enables comprehensive meta-analyses of gene set differential coexpression data, with an optional module of post hoc pathway crosstalk network analysis to identify and visualize pathways having similar coexpression profiles.Author summary Analyses of gene set differential coexpression often shed light on molecular mechanisms underlying phenotypes and diseases. However, results from conceptually similar individual studies are often inconsistent and underpowered to reach definitive conclusions. We provide an open-source application facilitating the aggregation of evidence of differential coexpression across studies and the estimation of more robust common effects, with an optional module of post hoc pathway crosstalk network analysis to identify and visualize pathways having similar coexpression profiles. We established the usefulness of MetaGSCA via case studies of chronic kidney disease and non-small cell lung cancer, and applied it to a pan-cancer analysis of 11 cancer types. We further demonstrated the tool with 100 randomly generated gene sets and showed excellent specificity, indicating low false positive rates.
Year
DOI
Venue
2021
10.1371/journal.pcbi.1008976
PLOS COMPUTATIONAL BIOLOGY
DocType
Volume
Issue
Journal
17
5
ISSN
Citations 
PageRank 
1553-734X
0
0.34
References 
Authors
0
10
Name
Order
Citations
PageRank
Yan Guo112.71
Hui Yu201.35
Haocan Song300.34
Jiapeng He400.34
Olufunmilola Oyebamiji501.35
Huining Kang601.35
Jie Ping701.01
Scott Ness801.69
Yu Shyr900.34
Fei Ye10173.44