Abstract | ||
---|---|---|
The huge volume of gene expression data produced by microarrays and other high-throughput techniques has encouraged the development of new computational techniques to evaluate the data and to formulate new biological hypotheses. To this purpose, co-clustering techniques are widely used: these identify groups of genes that show similar activity patterns under a specific subset of the experimental conditions by measuring the similarity in expression within these groups. However, in many applications, distance metrics based only on expression levels fail in capturing biologically meaningful clusters. We propose a methodology in which a standard expression-based co-clustering algorithm is enhanced by sets of constraints which take into account the similarity/dissimilarity (inferred by the Gene Ontology, GO) between pairs of genes. Our approach minimizes the intervention of the analyst within the co-clustering process. It provides meaningful co-clusters whose discovery and interpretation is increased by embedding GO annotations. |
Year | DOI | Venue |
---|---|---|
2009 | 10.1007/978-3-642-10291-2_43 | AI*IA |
Keywords | Field | DocType |
gene ontology,new biological hypothesis,new computational technique,gene expression data,meaningful co-clusters,biologically meaningful cluster,expression level,co-clustering process,standard expression-based co-clustering algorithm,ontology-driven co-clustering,co-clustering technique,high throughput,distance metric | Ontology,Data mining,Embedding,Gene ontology,Computer science,Gene expression,Biclustering,Transitive closure,DNA microarray | Conference |
Volume | ISSN | Citations |
5883 | 0302-9743 | 2 |
PageRank | References | Authors |
0.38 | 11 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Francesca Cordero | 1 | 63 | 13.42 |
Ruggero G. Pensa | 2 | 354 | 31.20 |
Alessia Visconti | 3 | 8 | 2.85 |
Dino Ienco | 4 | 295 | 42.01 |
Marco Botta | 5 | 284 | 41.98 |