Title
Empirical Study Of Symbolic Aggregate Approximation For Time Series Classification
Abstract
Symbolic Aggregate approximation (SAX) has been the de facto standard representation methods for knowledge discovery in time series on a number of tasks and applications. So far, very little work has been done in empirically investigating the intrinsic properties and statistical mechanics in SAX words. In this paper, we applied several statistical measurements and proposed a new statistical measurement, i. e. information embedding cost (IEC) to analyze the statistical behaviors of the symbolic dynamics. Our experiments on the benchmark datasets and the clinical signals demonstrate that SAX can always reduce the complexity while preserving the core information embedded in the original time series with significant embedding efficiency, as well as robust to missing values and noise. Our proposed IEC score provide a priori to determine if SAX is adequate for specific dataset, which can be generalized to evaluate other symbolic representations. Our work provides an analytical framework with several statistical tools to analyze, evaluate and further improve the symbolic dynamics for knowledge discovery in time series.
Year
DOI
Venue
2017
10.3233/IDA-150351
INTELLIGENT DATA ANALYSIS
Keywords
Field
DocType
SAX, knowledge discovery in time series, information embedding cost, permutation entropy, symbolic complexity
Econometrics,Symbolic aggregate approximation,Computer science,Theoretical computer science,Artificial intelligence,Empirical research,Machine learning,Time series classification
Journal
Volume
Issue
ISSN
21
1
1088-467X
Citations 
PageRank 
References 
0
0.34
9
Authors
5
Name
Order
Citations
PageRank
Wei Song111315.51
zhiguang wang2103.98
Fan Zhang321.40
Yangdong Ye411829.64
Ming Fan520.73