Title | ||
---|---|---|
Template Edge Similarity Graph Clustering For Mining Multiple Gene Expression Datasets |
Abstract | ||
---|---|---|
High throughput technologies have enabled the acquisition of large amounts of genomic data, including gene expression and RNA sequencing data for multiple species under various biological and environmental conditions. Recently, researchers have proposed methods for mining biological modules from gene co-expression networks. Biological inference from a single expression dataset suffers from spurious co-expression. Integrating multiple gene expression datasets is a promising strategy to alleviate the challenges of protein functional annotation and biological module discovery based on single gene expression data. We propose an integrative mining algorithm that constructs a template edge similarity graph whose nodes are the co-expression edges and a weighted edge connecting the two nodes corresponds to the structural similarity of the two edges across the co-expression graphs. Clustering the weighted edge similarity graph yields recurrent co-expression link clusters (modules). Experimental results on Human gene expression datasets show that the reported modules are functionally homogeneous as evident by their enrichment with biological process GO terms. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1504/IJDMB.2017.10007174 | INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS |
Keywords | Field | DocType |
co-expression networks, edge-edge similarity, biological modules | Gene,Annotation,Computer science,Inference,Gene expression,Structural similarity,Artificial intelligence,Throughput,Bioinformatics,Clustering coefficient,Cluster analysis,Machine learning | Journal |
Volume | Issue | ISSN |
18 | 1 | 1748-5673 |
Citations | PageRank | References |
1 | 0.35 | 0 |
Authors | ||
1 |
Name | Order | Citations | PageRank |
---|---|---|---|
Saeed Salem | 1 | 182 | 17.39 |