Title
Cluster analysis on time series gene expression data
Abstract
Cluster analysis is frequently used to study the trend of gene expression behaviours from microarray time series data. We adopt a partitioning-based clustering algorithm for such a task. After time series are discritised into sequences, a sequential pattern mining technique is applied to find patterns as the initial clusters. Longest Common Subseries Similarity is used to measure the similarity between time series which overcomes the 'shift-effect' influence. An object is re-assigned to the cluster which has most objects within the k nearest neighbours of the object. Similarity measurements, like Pearson correlation coefficient, are used to determine the neighbours.
Year
DOI
Venue
2010
10.1504/IJBIDM.2010.030299
IJBIDM
Keywords
Field
DocType
longest common subseries similarity,time series,partitioning-based clustering algorithm,similarity measurement,microarray time series data,sequential pattern mining technique,pearson correlation coefficient,time series gene expression,gene expression behaviour,initial cluster,cluster analysis,microarrays,business intelligence,data mining,gene expression,sequential pattern mining
Data mining,Similitude,Cluster (physics),Correlation coefficient,Time series,Pearson product-moment correlation coefficient,Computer science,Cluster analysis,Sequential Pattern Mining,DNA microarray
Journal
Volume
Issue
Citations 
5
1
4
PageRank 
References 
Authors
0.43
14
3
Name
Order
Citations
PageRank
Huang-Cheng Kuo14223.87
Tsung-Lung Lee240.43
Jen-Peng Huang3576.45