Title | ||
---|---|---|
Accelerating Dynamic Time Warping Clustering with a Novel Admissible Pruning Strategy |
Abstract | ||
---|---|---|
Clustering time series is a useful operation in its own right, and an important subroutine in many higher-level data mining analyses, including data editing for classifiers, summarization, and outlier detection. While it has been noted that the general superiority of Dynamic Time Warping (DTW) over Euclidean Distance for similarity search diminishes as we consider ever larger datasets, as we shall show, the same is not true for clustering. Thus, clustering time series under DTW remains a computationally challenging task. In this work, we address this lethargy in two ways. We propose a novel pruning strategy that exploits both upper and lower bounds to prune off a large fraction of the expensive distance calculations. This pruning strategy is admissible; giving us provably identical results to the brute force algorithm, but is at least an order of magnitude faster. For datasets where even this level of speedup is inadequate, we show that we can use a simple heuristic to order the unavoidable calculations in a most-useful-first ordering, thus casting the clustering as an anytime algorithm. We demonstrate the utility of our ideas with both single and multidimensional case studies in the domains of astronomy, speech physiology, medicine and entomology. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1145/2783258.2783286 | ACM Knowledge Discovery and Data Mining |
Keywords | Field | DocType |
Clustering,Time Series,Anytime Algorithms | Data mining,CURE data clustering algorithm,Dynamic time warping,Computer science,Artificial intelligence,Cluster analysis,Canopy clustering algorithm,Clustering high-dimensional data,Data stream clustering,Correlation clustering,Algorithm,Constrained clustering,Machine learning | Conference |
Citations | PageRank | References |
26 | 0.95 | 22 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Nurjahan Begum | 1 | 56 | 2.24 |
Liudmila Ulanova | 2 | 87 | 6.23 |
Jun Wang | 3 | 144 | 15.26 |
Eamonn J. Keogh | 4 | 11859 | 645.93 |