Title
Gene set enrichment ensemble using fold change data only
Abstract
Display Omitted We propose a new approach to compute gene set enrichment measures.The new approach, Gene Set Enrichment Ensemble (GSEE), uses fold change data only.GSEE is effective to identify common enriched gene sets across different studies.It facilitates the conduction of meta-analysis across multiple studies. In a number of biological studies, the raw gene expression data are not usually published due to different causes, such as data privacy and patent rights. Instead, significant gene lists with fold change values are usually provided in most studies. However, due to variations in data sources and profiling conditions, only a small number of common significant genes could be found among similar studies. Moreover, traditional gene set based analyses that consider these genes have not taken into account the fold change values, which may be important to distinguish between the different levels of significance of the genes. Human embryonic stem cell derived cardiomyocytes (hESC-CM) is a good representative of this category. hESC-CMs, with its role as a potentially unlimited source of human heart cells for regenerative medicine, have attracted the attentions of biological and medical researchers. Because of the difficulty of acquiring data and the resulting expenses, there are only a few related hESC-CM studies and few hESC-CM gene expression data are provided. In view of these challenges, we propose a new Gene Set Enrichment Ensemble (GSEE) approach to perform gene set based analysis on individual studies based on significant up-regulated gene lists with fold change data only. Our approach provides both explicit and implicit ways to utilize the fold change data, in order to make full use of scarce data. We validate our approach with hESC-CM data and fetal heart data, respectively. Experimental results on significant gene lists from different studies illustrate the effectiveness of our proposed approach.
Year
DOI
Venue
2015
10.1016/j.jbi.2015.07.019
Journal of Biomedical Informatics
Keywords
Field
DocType
Comparative analysis,Gene Set Enrichment Analysis,Human embryonic stem cell-derived cardiomyocytes
Small number,Data mining,Gene,Computer science,Profiling (computer programming),Gene expression,Information privacy,Fold change,Human heart
Journal
Volume
Issue
ISSN
57
C
1532-0464
Citations 
PageRank 
References 
0
0.34
22
Authors
5
Name
Order
Citations
PageRank
Hai Huang100.34
shaohong zhang281.79
Wen-Jun Shen300.34
Hau-San Wong4100886.89
Dongqing Xie527724.78