Title
A multi-step approach to time series analysis and gene expression clustering
Abstract
Motivation: The huge growth in gene expression data calls for the implementation of automatic tools for data processing and interpretation. Results: We present a new and comprehensive machine learning data mining framework consisting in a non-linear PCA neural network for feature extraction, and probabilistic principal surfaces combined with an agglomerative approach based on Negentropy aimed at clustering gene microarray data. The method, which provides a user-friendly visualization interface, can work on noisy data with missing points and represents an automatic procedure to get, with no a priori assumptions, the number of clusters present in the data. Cell-cycle dataset and a detailed analysis confirm the biological nature of the most significant clusters. Availability: The software described here is a subpackage part of the ASTRONEURAL package and is available upon request from the corresponding author. Contact: robtag@unisa.it Supplementary information: Supplementary data are available at Bioinformatics online.
Year
DOI
Venue
2006
10.1093/bioinformatics/btk026
Bioinformatics
Keywords
Field
DocType
feature extraction,data mining,gene expression,neural network,time series analysis,data processing,machine learning
Hierarchical clustering,Data mining,Data processing,Negentropy,Visualization,Computer science,Feature extraction,Probabilistic logic,Bioinformatics,Artificial neural network,Cluster analysis
Journal
Volume
Issue
ISSN
22
5
1367-4803
Citations 
PageRank 
References 
18
1.14
10
Authors
12
Name
Order
Citations
PageRank
Roberto Amato1394.51
Angelo Ciaramella211120.09
N. Deniskina3244.08
c del mondo4181.82
Diego Di Bernardo524422.35
Ciro Donalek6598.85
Giuseppe Longo77816.22
Giuseppe Mangano8181.82
Gennaro Miele9706.46
Giancarlo Raiconi1011815.08
Antonino Staiano1113115.83
Roberto Tagliaferri1242855.64