Title
Efficient Motif Discovery for Large-scale Time Series in Healthcare
Abstract
Analyzing time series data can reveal the temporal behavior of the underlying mechanism producing the data. Time series motifs, which are similar subsequences or frequently occurring patterns, have significant meanings for researchers especially in medical domain. With the fast growth of time series data, traditional methods for motif discovery are inefficient and not applicable to large-scale data. This work proposes an efficient Motif Discovery method for Large-scale time series (MDLats). By computing standard motifs, MDLats eliminates a majority of redundant computation in the related arts and re-uses existing information to the maximum. All the motif types and subsequences are generated for subsequent analysis and classification. Our system is implemented on a Hadoop platform and deployed in a hospital for clinical electrocardiography classification. The experiments on real-world healthcare data show that MDLats outperforms the state-of-the-art methods even in large time series.
Year
DOI
Venue
2015
10.1109/TII.2015.2411226
IEEE Transactions on Industrial Informatics
Keywords
Field
DocType
Data mining, motif, pattern discovery, time series
Data mining,Time series,Computer science,Motif (music),Computation
Journal
Volume
Issue
ISSN
PP
99
1551-3203
Citations 
PageRank 
References 
13
0.71
13
Authors
6
Name
Order
Citations
PageRank
Bo Liu114311.62
Jianqiang Li215619.55
Cheng Chen3550120.48
Wei Tan4131778.90
Qiang Chen5130.71
MengChu Zhou68989534.94