Title
A Two-Step Clustering For 3-D Gene Expression Data Reveals The Main Features Of The Arabidopsis Stress Response
Abstract
We developed an integrative approach for discovering gene modules, i.e. genes that are tightly correlated under several experimental conditions and applied it to a threedimensional Arabidopsis thaliana microarray dataset. The dataset consists of approximately 23000 genes responding to 9 abiotic stress conditions at 6-9 different points in time. Our approach aims at finding relatively small and dense modules lending themselves to a specific biological interpretation. In order to detect gene modules within this dataset, we employ a two-step clustering process. In the first step, a k-means clustering on one condition is performed, which is subsequently used in the second step as a seed for the clustering of the remaining conditions. To validate the significance of the obtained modules, we performed a permutation analysis and determined a null hypothesis to compare the module scores against, providing a p-value for each module. Significant modules were mapped to the Gene Ontology (GO) in order to determine the participating biological processes.As a result, we isolated modules showing high significance with respect to the p-values obtained by permutation analysis and GO mapping. In these modules we identified a number of genes that are either part of a general stress response with similar characteristics under different conditions (coherent modules), or part of a more specific stress response to a single stress condition (single response modules). We also found genes clustering within several conditions, which are, however, not part of a coherent module. These genes have a distinct temporal response under each condition. We call the modules they are contained in individual response modules (IR).
Year
DOI
Venue
2007
10.2390/biecoll-jib-2007-54
JOURNAL OF INTEGRATIVE BIOINFORMATICS
Keywords
Field
DocType
stress response
Arabidopsis,Abiotic stress,Fight-or-flight response,Data mining,Stress conditions,Computer science,Permutation,Gene expression,Gene Modules,Bioinformatics,Cluster analysis
Journal
Volume
Issue
ISSN
4
1
1613-4516
Citations 
PageRank 
References 
3
0.39
9
Authors
7
Name
Order
Citations
PageRank
Martin Strauch1532.65
Jochen Supper21068.69
Christian Spieth311912.87
Dierk Wanke4632.81
Joachim Kilian530.39
Klaus Harter6622.46
Andreas Zell71419137.58