Title
The Choice of an Appropriate Information Dissimilarity Measure for Hierarchical Clustering of River Streamflow Time Series, Based on Calculated Lyapunov Exponent and Kolmogorov Measures.
Abstract
The purpose of this paper was to choose an appropriate information dissimilarity measure for hierarchical clustering of daily streamflow discharge data, from twelve gauging stations on the Brazos River in Texas (USA), for the period 1989-2016. For that purpose, we selected and compared the average-linkage clustering hierarchical algorithm based on the compression-based dissimilarity measure (NCD), permutation distribution dissimilarity measure (PDDM), and Kolmogorov distance (KD). The algorithm was also compared with K-means clustering based on Kolmogorov complexity (KC), the highest value of Kolmogorov complexity spectrum (KCM), and the largest Lyapunov exponent (LLE). Using a dissimilarity matrix based on NCD, PDDM, and KD for daily streamflow, the agglomerative average-linkage hierarchical algorithm was applied. The key findings of this study are that: (i) The KD clustering algorithm is the most suitable among others; (ii) ANOVA analysis shows that there exist highly significant differences between mean values of four clusters, confirming that the choice of the number of clusters was suitably done; and (iii) from the clustering we found that the predictability of streamflow data of the Brazos River given by the Lyapunov time (LT), corrected for randomness by Kolmogorov time (KT) in days, lies in the interval from two to five days.
Year
DOI
Venue
2019
10.3390/e21020215
ENTROPY
Keywords
Field
DocType
streamflow time series,Brazos River,average-linkage clustering hierarchical algorithm,K-means clustering,Kolmogorov complexity-based measures,largest Lyapunov exponent,Lyapunov time,Kolmogorov time,predictability of streamflow time series
Hierarchical clustering,k-means clustering,Kolmogorov complexity,Permutation,Lyapunov time,Statistics,Cluster analysis,Lyapunov exponent,Mathematics,Randomness
Journal
Volume
Issue
ISSN
21
2
1099-4300
Citations 
PageRank 
References 
0
0.34
5
Authors
8