Title
Template Edge Similarity Graph Clustering For Mining Multiple Gene Expression Datasets
Abstract
High throughput technologies have enabled the acquisition of large amounts of genomic data, including gene expression and RNA sequencing data for multiple species under various biological and environmental conditions. Recently, researchers have proposed methods for mining biological modules from gene co-expression networks. Biological inference from a single expression dataset suffers from spurious co-expression. Integrating multiple gene expression datasets is a promising strategy to alleviate the challenges of protein functional annotation and biological module discovery based on single gene expression data. We propose an integrative mining algorithm that constructs a template edge similarity graph whose nodes are the co-expression edges and a weighted edge connecting the two nodes corresponds to the structural similarity of the two edges across the co-expression graphs. Clustering the weighted edge similarity graph yields recurrent co-expression link clusters (modules). Experimental results on Human gene expression datasets show that the reported modules are functionally homogeneous as evident by their enrichment with biological process GO terms.
Year
DOI
Venue
2017
10.1504/IJDMB.2017.10007174
INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS
Keywords
Field
DocType
co-expression networks, edge-edge similarity, biological modules
Gene,Annotation,Computer science,Inference,Gene expression,Structural similarity,Artificial intelligence,Throughput,Bioinformatics,Clustering coefficient,Cluster analysis,Machine learning
Journal
Volume
Issue
ISSN
18
1
1748-5673
Citations 
PageRank 
References 
1
0.35
0
Authors
1
Name
Order
Citations
PageRank
Saeed Salem118217.39