Title
Assessing the functional structure of genomic data.
Abstract
The availability of genome-scale data has enabled an abundance of novel analysis techniques for investigating a variety of systems-level biological relationships. As thousands of such datasets become available, they provide an opportunity to study high-level associations between cellular pathways and processes. This also allows the exploration of shared functional enrichments between diverse biological datasets, and it serves to direct experimenters to areas of low data coverage or with high probability of new discoveries.We analyze the functional structure of Saccharomyces cerevisiae datasets from over 950 publications in the context of over 140 biological processes. This includes a coverage analysis of biological processes given current high-throughput data, a data-driven map of associations between processes, and a measure of similar functional activity between genome-scale datasets. This uncovers subtle gene expression similarities in three otherwise disparate microarray datasets due to a shared strain background. We also provide several means of predicting areas of yeast biology likely to benefit from additional high-throughput experimental screens.Predictions are provided in supplementary tables; software and additional data are available from the authors by request.Supplementary data are available at Bioinformatics online.
Year
DOI
Venue
2008
10.1093/bioinformatics/btn160
ISMB
Keywords
Field
DocType
diverse biological datasets,supplementary data,genome-scale data,low data coverage,genomic data,saccharomyces cerevisiae datasets,biological process,current high-throughput data,functional structure,genome-scale datasets,systems-level biological relationship,additional data,gene expression,high throughput
Data science,Data mining,Computer science,Software,Bioinformatics
Conference
Volume
Issue
ISSN
24
13
1367-4811
Citations 
PageRank 
References 
7
0.57
11
Authors
2
Name
Order
Citations
PageRank
Curtis Huttenhower143830.18
O. G. Troyanskaya21733144.94